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Abstract 


The main topic of this present thesis is the study of the asymptotic be¬ 
haviour of sequences modulo 1. 

In particular, by using ergodic and dynamical methods, a new insight to 
problems concerning the asymptotic behaviour of multidimensional sequences 
can be given, and a criterion to construct new multidimensional uniformly 
distributed sequences is provided. 

More precisely, one considers a uniformly distributed sequence (u.d.) 
(x n ) ng ^ in the unit interval, i.e. a sequence satisfying the relation 

1 N r 1 

T m oo n £ = l f^ dx • 

71=1 


for every continuous function / defined on [0,1]. 

This relation suggests the possibility of a numerical approximation of the 
integral on the right-hand side by means of u.d. sequences, even if it does 
not give any information on the quality of the estimator. 

The following quantity 


D* n = D* n (x x n ) 


sup 

ae[0,i[ 


£ 


N 

77=1 


1[0 ,a[( x n) 

~N 


A([0,a[) , 


called the star discrepancy , has been introduced in order to have a quanti¬ 
tative insight on the rate of convergence. 

One of the most important results about the integration error of this approx¬ 
imation technique is given by the Koksma-Hlawka inequality which states 
that the error can be bounded by the product of the variation of / (in the 
sense of Hardy and Krause), denoted by V (/), and the star-discrepancy D* N 
of the point sequence (x n ) ng N: 

1 N 
n —1 

Thus in order to minimize the integration error we have to use point se¬ 
quences with small discrepancy, that is, sequences which achieve a star- 
discrepancy of order 0(IV -1 (log N)). These sequences are called low-discrepancy 


f(x)dx 


< VU)D* N {X T 



sequences and they turn out to be very useful especially for the approxima¬ 
tion of multidimensional integrals. In this context, the error in the ap¬ 
proximation is smaller than the probabilistic one of the standard Monte 
Carlo method, where a sequence of random points instead of determinis¬ 
tic points, is used. Methods using low-discrepancy sequences, often called 
quasi-random sequences, are called Quasi-Monte Carlo methods (QMC). 

However, to construct low-discrepancy sequences, especially multidimen¬ 
sional ones, and to compute the discrepancy of a given sequence are in gen¬ 
eral not easy tasks. 


The aim of this thesis is to provide a full description of these problems 
as well as the methods used to handle them. The idea was to consider tools 
from ergodic theory in order to produce new low-discrepancy sequences. The 
starting point to do this, is to look at the orbit, i.e. the sequence of iterates, 
of a continuous transformation T defined on the unit interval which has the 
property of being uniquely ergodic. 

The unique ergodicity of the transformation has the following consequence: 


If T : [0,1] —> [0,1] is uniquely ergodic, then for every / £ T x (X) 

1 iV_1 r 

lim Y /( T ” x ) = / f( x )M x ) , 

iV-»oo iv z ' / v 
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for every x £ X. 

So the orbit of x under T is a uniformly distributed sequence. 

In this respect, we devoted the first chapter entirely on classical topics 
in uniform distribution theory and ergodic theory. 

This provides the basic requirements for a complete understanding of the 
following chapters, even to a reader who is not familiar with the subject. 

Chapter 2 deals with a countable family of low-discrepancy sequences, 
namely the LS'-sequences of points. In particular, one of these sequences 
will be considered in full detail in Chapter 3. 

The content of Chapters 3, 4 and 5 is based on three published papers 
that I co-authored. 

In Chapter 3, the method used to construct the transformation T is the 
so-called “cutting-stacking” technique. In particular, we were able to prove 
the ergodicity of T (a weaker property than unique ergodicity), and that 
the orbit of the origin under this map coincides with an LS’-sequence which 



turns out to be a low-discrepancy one. 


In Chapter 4, another approach from ergodic theory is used. This ap¬ 
proach is based on the study of dynamical systems arising from numeration 
systems defined by linear recurrences. In this way we could not only prove 
that the transformation T defined in Chapter 3 is uniquely ergodic, but we 
could also construct multidimensional uniformly distributed sequences. 

Finally, we go back to the problem of finding an approximation for inte¬ 
grals. In Chapter 5 we tried to find bounds for integrals of two-dimensional, 
piecewise constant functions with respect to copulas. Copulas are func¬ 
tions that can be viewed as asymptotic distribution functions with uniform 
margins. To solve this problem, we had to draw a connection to linear as¬ 
signment problems, which can be solved efficiently in polynomial time. The 
approximation technique was applied to problems in financial mathematics 
and uniform distribution theory. 
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Chapter 1 

Preliminary topics 


In this chapter we collect classical results about uniform distribution, dis¬ 
crepancy and ergodic theory, which are the essential background for the 
following chapters. 

1.1 Uniform distribution and discrepancy of se¬ 
quences 

In this first section we recall definitions and basic properties of uniform dis¬ 
tribution and discrepancy of sequences. For a full treatment of the subject 
and for proofs we refer to [221 GSJ ES ES]. 

The structure of the section is the following: a first part is devoted to the 
definition of uniformly distributed (u.d.) sequences of points in the unit 
interval. 

Furthermore we provide some historical examples constructed in [0,1[, such 
as Kronecker and van der Cor put sequences. 

Then we introduce the notion of uniform distribution and discrepancy for 
sequences of partitions defined on [0,1 [, showing how to get associated uni¬ 
formly distributed sequences of points from them. 

Finally we extend the definitions and the results to the multidimensional 
unit cube [0, l[ s and we see how this theory can be applied to Quasi-Monte 
Carlo methods to find numerical approximation of integrals. 

1.1.1 Preliminary definitions and results in [0,1 [ 

The starting point of uniform distribution is given by the following definition 
introduced by H. Weyl [104 , 1105] . 


Definition 1.1.1. A sequence (x n )neN of points in [0, 1 [ is said to be uni- 
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formly distributed (u.d.) if for every interval [a,b[ C [0,1[ we have 

1 N 


1 $% 0 NT, 1 l*d x ”) = b-a 


( 1 . 1 ) 


n =1 


where l[ a ,6[ the characteristic function of[a,b[. 


Observing that A([a, &[) = fj l[ a ,&[(x) dx, (1.1) can be written in the 
equivalent form 


n= 1 

This remark together with a classical approximation process of continuous 
functions by means of step functions led to the following criterion that can 
be also found in [1041 105] . 

Theorem 1.1.2 (Weyl’s Theorem). A sequence (x n ) ne pj of points in [0,1 [ 
is u. d. if and only if for every real-valued continuous function f defined on 
[ 0 , 1 ] 

1 N 

J im = 

TV—>-oo TV ^^ 
n —1 

holds. 




As it will appear clear at the end of the chapter, Theorem 1.1.2 suggests 
a numerical approximation of integrals by means of averages over uniformly 
distributed sequences. 


Two more general results can be found as corollaries of the previous 
theorem. The first one is a generalization to Riemann-integrable functions, 
while the second one to complex-valued continuous functions. 


Corollary 1.1.3. A sequence (x' n ) ne pj of points in [0,1 [ is u.d. if and only 
if for every Riemann-integrable function f defined on [0,1] equation (1.3) 
holds. 


Corollary 1.1.4. A sequence (x n )neN of points in [0,1 [ is u.d. if and only 
if for every complex-valued continuous function f defined on M with period 
1 equation (1.3) holds. 


This second corollary is particularly important since funtions of the type 
f{x) = e 2mhx , where h is a non-zero integer, satisfy its assumptions. Hence, 
these functions give a criterion to determine if a sequence of points is u.d.. 
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Theorem 1.1.5 (Weyl’s Criterion). A sequence (x n ) ne pj of points in [0,1 [ 
is u. d. if and only if 


lim — V e 2nihXn = 0 
N— >OG N — J 
n =1 


(1.4) 


for every integer h / 0. 


The first proof of this theorem has been given by Weyl jl04 , l()5 r but 
several proofs can be found in literature. As we will see later, Weyl applied 
this theorem to show that Kronecker’s sequence ({n0}) ng ^, where 6 is ir¬ 
rational and {x} is the fractional part of x, is u.d. in [0,1[. This criterion 
is of particular interest since it emphasizes the connection between uniform 
distribution theory and the estimation of exponential sums. 


Now we would like to introduce a quantity, the discrepancy, which mea¬ 
sures the maximal deviation between the empirical distribution of a sequence 
and the uniform distribution. This term has been introduced by van der Cor- 
put and the first intensive study of discrepancy is due to van der Corput 
and Pisot m- 

For a detailed survey about discrepancy of sequences and applications we 
refer to [29] , 

Definition 1.1.6 (Discrepancy). Letu = {x\,... , xat} be a finite set of real 
numbers in [0,1[. The quantity 


Dn(u) 


sup 

0<a<6<! 


77 . = 1 


a) 


(1.5) 


is called the discrepancy of the given set u. 

If (x n ) ne N is an infinite sequence of points in [0,1[, we associate to it the 
sequence of positive real numbers Dn({xi,X 2 , ■ ■ ■ ,xn})- So, the notation 
Dn{%u) denotes the discrepancy of the initial segment {x\,X 2 , ■ ■ ■ ,xat} of 
the infinite sequence. 

The following result shows how discrepancy is related to uniform distribu¬ 
tion. 


Theorem 1.1.7. A sequence (x n )neN of points in [0, 1 [ is u.d. if and only 

if 

lim Dn(x„) = 0 . (1.6) 

TV—>-oo 

Sometimes it is useful to restrict the family of intervals considered in the 
definition of discrepancy to intervals of the form [0, a[ with 0 < a < 1. This 
leads to the following definition of star-discrepancy. 
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Definition 1.1.8 (Star discrepancy). Let ui = {x\, ...,aqv} be a finite set 
of real numbers in [0,1[. We define star discrepancy of oj the quantity 


D* n ( u) 


sup 

0<a<l 



n=1 


l[0,a[(®n) 



(1.7) 


This definition extends to an infinite sequence in the same way as it has 
been done for the discrepancy. 

Moreover, discrepancy and star discrepancy are related by the following 
relation. 


Theorem 1.1.9. For any sequence (, x n ) n of points in [0,1 [ we have that 

D* N (x n ) < D N (x n ) < 2 D* N (x n ) . (1.8) 

As a consequence, it follows that Theorem 1.1.7| holds also for star dis¬ 
crepancy, i.e. lini 7 v->-oo D* N (x n ) = 0 if and only if (x n ) n ^ is u.ch. 


The most well-known problem in the theory of irregularities of distribu¬ 
tions is to determine the optimal lower bound for the discrepancy D^{x n ). 
Discrepancy bounds from below are essentially quantitative measures for the 
irregularity of point distributions. 

A very easy observation gives the following trivial lower bound. 


Proposition 1.1.10. For any finite point set u 
have that 


1 

N 


< Dn(lo) < 1 . 


{x\,... ,xn} in [0,1[ we 


The finite sequence (x n ) = (Jf) with n = 0,..., N — 1 satisfies D]y(x n ) = 
jy . But sequences of this kind, i.e. showing a discrepancy of this order, can 
only exist in the one-dimensional case, as it will be clear in §1.1.3 as direct 
consequence of Roth’s Theorem (Theorem 1.1.51). 

According to the previous Proposition, one could ask whether the lower 
bound is really attained by an infinite sequence in the unit interval. Van 
der Corput conjectured that such a sequence does not exist, i.e. there is no 
real sequence (x n )neN satisfying D]\r(x n ) = O (^) as N —> oo. 

Although this problem was first solved by van Aardenne-Ehrenfest [941 95], 
the conjecture was completely proved from a quantitative point of view as 
shown in the following important result due to Schmidt [ 85] . 


Theorem 1.1.11 (Schmidt’s Theorem). For any sequence (x n )neN in [0,1[ 
we have 

log; N 

D n ( x n ) > C 

for infinitely many N , where C > 0 is an absolute constant. 
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Sequences having discrepancy of the order O y ff* j are called low dis¬ 
crepancy sequences or quasi random sequences. As we will see in the last 
section, they play an important role in applications and they are especially 
used in the implementation of Quasi-Monte Carlo methods. 

In particular, when dealing with the implementation of an algorithm, one 
actually considers point sets instead of infinite sequences. We now state two 
results which turn out to be useful for estimating the discrepancy of a given 
point set. 

Theorem 1.1.12. If 0 < x\ < X 2 < • • • < xn < 1, then 

D N (X!,..., x n ) = + max (- min (- x n ) . 

N l<n<N \N ) l<n<N \N ) 

Theorem 1.1.13. Let u = {xq,... ,xj\r} be a finite set of N points in [0,1[. 
For 1 < j < r let Uj be a subset of u consisting of Nj elements such that its 
discrepancy is Djy^ujj), its star discrepancy is D* N .(ujj), with t Oj n cu* = 0 
for all j / i and N = N\ + ... + N r . Then 

Dn(v) < ^2 y / ' ) .Y;(-'/) 

3 =i 

and also 

r N- 

D*n( u ) ^ X/ 

3 =i 

Now we would like to show two classical examples of low discrepancy 
sequences. 

The two sequences that we are going to present got considerable attention 
since their introduction at the beginning of 1900. 

Example 1.1.14 (Kronecker sequence). Let 9 be an irrational number. 
Then the sequence ({n#}) ng pj is known as Kronecker sequence. It is an 
easy application of Weyl’s criterion to show that it is u.d.. 

In fact, 

]_ — ^ e 27 r *^)A r + 1 

N \ 1 — e 2 7ri/i0 


1 

^ _ g2nih6(N+l) _ ^ g27r ih6 

l 

2n ihO 

^ _ g 2nihN6 

N 

^ _ ^lirihO 

“ N 

o 

^ g27 xihO 




1 

N 


^ g 2nihN6 

^ g27 HhO 
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Now if we let a = irhd, we can observe that 


11 — e 


= |1 — cos(2a) — zsin(2a)| = \J (1 — cos(2ct)) 2 + sin 2 (2a) 

= \/2(l — cos(2a)) = \J 2(1 — cos 2 a + sin 2 a) = 2\ sii 

This is enough to see that e§ holds since 


sm a 


N 

£■ 


1 

N > 6 " 

n=l 


,27 rifinO 


1 

^ g 2nihN6 

N 

2 sin(7r h9) 


< 


i 


N\ sin(7r/i0)| 

which tends to zero when N tends to infinity for an integer h 0. 


This result has been found independently by Bohl m , Sierpihski m 
and Weyl \ 705\/ in 1909-1910. 

The sequence takes the name of Kronecker since this results refines a the¬ 
orem due to Kronecker showing that the points e md are dense in the unit 
circle, whenever 0 is an irrational multiple of n (Kronecker’s approximation 
theorem). 

The following result shows that bounds for the discrepancy of the Kronecker 
sequence are strictly connected with the partial quotients of the continued 
fraction expansion of 9 and therefore with the Ostrowski expansion. The 
proof requires some basics about continued fractions and the related Os¬ 
trowski expansion and so we do not provide it. We refer to \6(fi for a proof. 

Theorem 1.1.15. The discrepancy -Djv(x n ) of the Kronecker sequence sat¬ 
isfies 

^ m ^ m+1 

D N (x n ) < — ^(c fc + i) < — 5^ Oi , (1.9) 

k =0 k =1 

c k ^o 

where the at’s are the partial quotiens of the continued fraction expansion of 
9 and the c n ’s are the integers in the Ostrowski expansion of N. 

Corollary 1.1.16. If 9 is an irrational number such that Y2k‘=i a k 
then 

D N (x n ) = O ( 1 ° g ^ /V ^ for all N > 2 

Example 1.1.17 (Van der Corput sequence). The van der Corput sequence 
has been introduced by van der Corput m/ in 1935. The main idea to 
provide bounds for the discrepancy of this sequence is to introduce a suitable 
numeration system, as the Ostrowski numeration system for the Kronecker 
sequence. There is one natural numeration system arising from the definition 
of the van der Corput sequence, namely, the classical system of numeration 
in base b, where b > 2 is apositive integer. 


= 0(m), 
( 1 . 10 ) 
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Proposition 1.1.18 (6-ary expansion). Let b > 2 be a positive integer and 
denote by Zb = {0,1,... ,b — 1} the least residue system mod b. Then every 
positive integer n > 0 has a unique expansion 

r 

n = '^2a k (n)b k ( 1 . 11 ) 

k= i 


in base b, where a k (n) E Zb for all j > 0 and r = [log^nj, where |_£j is the 
largest integer not greater than x. 


Definition 1.1.19 (Radical inverse function). For an integer b > 2, con¬ 
sider the b-ary expansion in ( |1.11 ) of n E N. The function ipb '■ N —> [0,1[ 
defined as 


<Pb(n) = ^ o fc (n)6 


-fc-i 


( 1 . 12 ) 


k =0 


is called radical inverse function in base b. 


In other words, <p>b( n ) is obtained from n by a symmetric reflection of 
the expansion (1.11) with respect to the decimal point. 


Definition 1.1.20. The sequence ( x n ) n eN of general term x n = (n) is 

called van der Corput sequence. 


Remark 1.1.21. It is common use to call van der Corput sequences in base 
b, sequences (x n ) ne pj of general term x n = </?b(n), where b is a fixed prime 
number greater than 1. 


Theorem 1.1.22. The discrepancy D]y(x n ) of the van der Corput sequence 
{Tb(n)) n( z N satisfies 


D N (x n ) < c b 


^ log(AT -I" 1) ^ 


where Cb > 0 is an absolute constant. 

In particular, Faure fffH Theorem 6] established the following bounds for 
arbitrary b > 2: 

limcup iVjPAr( ^ (n)) ~ f 4(6+i1io gb for even 6 
N^-oo log N \ 4 for odd 6 . 

^4n immediate consequence of this result is the fact that the van der Corput 
sequence which has the best asymptotic behaviour is the one in base 3. 
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Another generalization of the van der Corput sequence, called generalized 
van der Corput sequence in base b, which improves the asymptotic behaviour 
of the discrepancy, has been introduced in m ■ This new family of sequences 
is defined as the sequence of general term 

m 

x n = a{ak{n))b~ k l for all n > 0, 
k =o 

where a is a permutation of Z 5 . 

In particular, if we denote by (ipf(n)) n ^ a generalized van der Corput se¬ 
quence in base b, the following result says that the discrepancy is smaller 
compared to that of the classical van der Corput sequence ( <p b (n )) in base b. 

Theorem 1.1.23 (Corollaire 3, [31]). For every sequence tpf we have 
D N(Tbi n )) < D N(Tb( n )) < D N (<p b (n)) 


1.1.2 Uniform distribution of sequences of partitions of [0,1[ 

In this section we consider a method to obtain u.d. sequences of points 
in the unit interval as sequences of points associated to a special family of 
sequences of partitions. We present new sequences of partitions recently 
introduced and studied by Volcic m , which are a generalization of the 
sequences introduced by Kakutani [53] in 1976. 

Definition 1.1.24. Let (it n ) n eN be a sequence of partitions 0 /[0,1[, with 


1 [ : 1 < i < k(n)} . 


Then ( 7r n ) n is uniformly distributed (u.d.) if for any continuous function 
f on [ 0 , 1 ] 


lim 

n —>00 



f(t) dt . 


(1.13) 


Remark 1.1.25. Given a sequence of partitions ( 7 r n ) ne pj, it is possible to 
associate to it a sequence of measures (/r n )neN by 


f^n — 


1 

k(n) 


k(n) 



2—1 


(1.14) 


where we denote by 5t the Dirac measure concentrated at t. 

We note that the uniform distribution property of the sequence of partitions 
(TTn)neN is equivalent to the weak convergence of (/r n )neN t° the Lebesgue 
measure A on [0,1]. 


This observation allows us to give the following equivalent definition. 
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Definition 1.1.26. A sequence of partitions (n n ) ne ^ is u.d. if and only if 
for every interval [a, b] C [0,1] 


kin) 



(1.15) 


Equivalently, (7r n ) n( =N is u.d. if the sequence of discrepancies 



(1.16) 


tends to 0 as n —> oo. 

Let us describe Kakutani’s splitting procedure [53], known as Kakutani’s 
a-refinement, which allows to construct a whole class of u.d. sequences of 
partitions of [0,1[. 

Definition 1.1.27 (Kakutani splitting procedure). Let a 6 ]0,1[ and vr = 
{ [t{ , ti + 1 [: 1 < i < k} be any partition of [0,1[, then Kakutani’s a-refinement 
of n (which will be denoted by an) is obtained by splitting only the intervals 
of n having maximal length in two parts, proportionally to a and 1 — a 
respectively. 

If we denote by a n 7r the a-refinement of a n ~ 1 n for every n £ N, the 
so-called Kakutani’s sequence of partitions (ft n ) n eN = (« Tl w)neN is obtained 
by successive a-refinements of the trivial partition u: = {[0,1]}. 

Observe that for a = ( Kakutani’s sequence of partitions coincides with 
the binary sequence of partitions. 

Kakutani proved the following result. 

Theorem 1.1.28. For every a e]0, 1[ the sequence of partitions (K n ) n eN is 
u.d.. 

The sequence (a n u;) rie i^ and its properties have been investigated by 
many authors. For example, see m and [98] for a modification of the 
splitting procedure where the intervals of maximal length are split randomly. 
For a generalization of Kakutani’s splitting procedure in higher dimensions 
see Carbone and Volcic [22]. Another further generalization of Kakutani’s 
splitting procedure, that we now describe, was introduced by Volcic m- 

Definition 1.1.29 (p-refinement). Let p be a non-trivial finite partition of 
[0,1 [. The p-refinement of a partition n of [0,1 [ (denoted by pn) is the 
partition obtained by subdividing all intervals of n having maximal length 
positively homotetically to p. 
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Remark 1.1.30. If p = {[0, a\, [a, 1 [} and it = ui, then its p-refinement 
coincides with Kakutani’s a-refinement. 

As in the case of Kakutani’s splitting procedure, we can consider the p- 
refinement p 2 ir of pn and so on in order to obtain the sequence of partitions 
(p n n)neN defined as follows. 

Definition 1.1.31. Given a non-trivial finite partition p of [0,1[. the se¬ 
quence of p-refinements ( p n ir ) n eN of it is defined as the sequence of partitions 
obtained by successive p-refinements of it. 

By using arguments from ergodic theory, Volcic m proved that the 
sequence ( p n iw) n eN is u.d. for every finite partition p. 


A natural problem is to estimate the asymptotic behaviour of the dis¬ 
crepancy of (p n 7r) neN . 

A first result in this direction has been obtained by Carbone m, who con¬ 
sidered the so-called LS'-sequences for p consisting of L intervals of length 
a and S intervals of length a 2 , such that aL + a 2 S = 1. 

We will provide a complete description of this class of sequences in Chapter 

2 . 

The first general results providing upper bounds for the discrepancy of se¬ 
quences of arbitrary p-refinements of u have been found by Drmota and 
Infusino [28| . The authors consider a new approach based on the analysis of 
a tree evolution process, namely the Khodak algorithm, where the genera¬ 
tion of successive nodes has the same structure as the p-refinement process. 
Let us state the results found in [28] , 


Definition 1.1.32. Let p\,...,p m be positive integers. We say that 
log ,..., log are nationally related if there exists a positive number 

A such that log ( J ,..., log ( ^ J are integer multiples of A, that is 


log 



njA , with nj E Z for j = 1 ,... ,m 


Without loss of generality, we can assume that A is as large as possible which 
is equivalent to assume that gcd(ni,..., n m ) = 1. 

We say that log ,... ,log are irrationally related if they are not 

rationally related. 


Theorem 1.1.33. Suppose that the lengths of the intervals of a partition 
p of [0,1[ are p\,... ,p m and that log j for j = 1,..., m are rationally 
related. Then there exist a real number rj > 0 and an integer d > 0 such 
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that the discrepancy D n of (p n u;)rceN is bounded by 


D 


n 


Vo ( (log fc(n.)) d \ 

W l k(n)v J 
n f (logA.-p-i)) d + 1 \ 
^ V k(n)v ) 



if 0 < Tj < 1 , 

if n = 1 , 
if ?7 > 1 . 


The next theorem requires the following 


Definition 1.1.34. Every irrational number x has an approximation con¬ 
stant c(x) defined by 

c(x) = lirninf q\qx — p\ , 

#—>■ OO 

where p is the nearest integer to qx. 

Moreover x is said to be badly approximable if c(x) > 0. 


Theorem 1.1.35. Suppose that the lengths of the intervals of a partition p 
of [0,1[ are p and q = 1 — p. If ^ Q is badly approximable, then the 
discrepancy D n of (p n u) is bounded by 


(log log(fc(n))\ 4 


D n = 0 

\\ log {k(n)) 
Furthermore, ifp,q are algebraic numbers then 
D = 0 floglog(k(n))' 


n — y oo . 


i i /, / xx ; ; > n —•> OO) 

A log (k(n)) J J 

where k is an effectively computable positive real constant. 


In this second case, we can observe that weaker upper bounds for the 
discrepancy are obtained, since they depend heavily on Diophantine approx¬ 
imation properties of the ratio j^jL2. 

Finally, the authors proved bounds for the elementary discrepancy of the 
sequences of partitions constructed for a certain class of fractals. 


A second interesting problem arising in this context concerns the uniform 
distribution of sequences of partitions, when the starting partition n is not 
the trivial partition oj. 

As pointed out in [101] . if we start from an arbitrary initial partition vr, then 
the sequence of partitions ( / o n 7r) rig N is in general not u.d.. 

Remark 1.1.36. Let us consider 


f 

2 


2 

n r 

1 


o, - 


-,1 

> and p = < 

o, - 

X 

5 


5’ 

11 X 
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It is clear that the p—refinement operates alternatively on [|, l[ and [0, | [. 
So, if we consider the sequence of measures (p n )n£N associated to (p n ir) n£ f$, 
then for any measurable set E C [0,1 [ the subsequence (^2n)n.eN converges 
to 


A 2n(E) 


5 

4 


( fin 

2 

0 , - 

) +^-A 

'Bn 

2 

-,1 

V 

5 

u 6 


5’ 


while the subsequence (/X2n+i)neN converges to 


^2n+l{E) 


5 , ( 

2 

io ( 

2 

-•A Bn 

0 ,- 


-,1 

6 V 

5 

[) 9 V 

5 


Hence, (^ n )neN does not converge and consequently (p n 7 r) ne pj is not u.d.. 

It is worthwhile noticing that the same problem arises even in the simplest 
case of Kakutani’s splitting procedure. 

So, it is essential to find sufficient conditions on it in order to guarantee the 
uniform distribution of (a 71 vr) ne p} or more in general of ( p n Tr) n eN- 


The solution to this problem has been found by Aistleitner and Hofer 
[2], who gave necessary and sufficient conditions on ir and p under which 
the sequence (p n n) n ^ is uniformly distributed. 

The authors proved the following result, using methods introduced in [28]. 

Theorem 1.1.37. Let p be a partition of [0,1[ consisting of m > 2 intervals 
of lengths pi, ■ ■ ■ ,p m , an d let ir be an initial partition of [0,1 [ consisting 
of l > 2 intervals of lengths a \. Then the sequence (p n Tr) n£ ^ is 
uniformly distributed if and only if one of the following conditions is satisfied: 

1. the real numbers log ,..., log j are irrationally related 

2. the real numbers log ,..., log are rationally related with pa¬ 

rameter A and 


oti = ce ViA , c €E M + , Vi G Z, i = 1... I . 

Remark 1.1.38. Condition 2. includes the special case when the initial 
partition it consists of intervals having the same length, and in particular the 
case when the initial partition is the trivial partition uj and the corresponding 
sequence of partition is a Kakutani sequence. 

In particular, the following corollary gives conditions under which the 
Kakutani sequence of partitions of a particular initial sequence n is u.d.. 

Corollary 1.1.39. Let p = {[0,p[, [p, 1[} and n = {[0, a[, [a, 1[}. Then 
(p n ^)n£N Is u.d. if and only if one of the following conditions is satisfied: 

iog(p) 

log(l-p) 


1. 


is irrational 
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2. log and log are rationally related with parameter A and 

« = ~ekhp[ f or fc6Z ' 

Furthermore, in |2j a description of the general asymptotic behaviour of 
a sequence of partitions, even in cases not covered by Theorem 1.1.37, is 
given. More precisely, the authors proved the following 

Theorem 1.1.40. Assume that neither condition 1. nor condition 2. of 
Theorem 1.1.37 is satisfied. Then for any interval A = [a,b] C [0,1] which 


is completely contained in the i-th interval of the initial partition n for some 
i, 1 < i < l, we have 


1 fc(n) 

HmSUp kUA 2 hrb}^) = Clip ~ a) , 

n—>-oc K[rl) ' 

J=1 

^ fc(n) 

liminf —— V 1 [ a ,b\(tj) = c 2 {b - a) , 
n—>oc fc[n) z ' 1 J J 

where 

^ = fe (a { log X 106 

are constants depending on i. 

Hence, if either condition 1. or 2. does not hold, then the sequence 
( / o n 7r) neN is not u.d.. 

Now we would like to discuss how to associate to a u.d. sequence of parti¬ 
tions a u.d. sequence of points. 

The problem is the following: Assume that (7r n ) ne pj is a u.d. sequence of 
partitions of [0,1[. Is it possible to rearrange the points tf determining the 
partitions n n , for 1 < i < k(n), in order to get a u.d. sequence of points? 
Clearly, we can reorder points in several different ways and a natural restric¬ 
tion is that we first reorder all the points determining 7Ti then those defining 
7T2, and so on. A reordering of this kind is called sequential reordering. 

We introduce a result proved by Volcic in Em. where a probabilistic answer 
to this problem is given. 

Definition 1.1.41. If (ir n )n£N is a u.d. sequence of partitions of [ 0,1 [ with 

= 
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the sequential random reordering of the points If is a sequence N 

of consecutive blocks of random variables. The n-th block consists of k{n) 
random variables which have the same law and represent the drawing, with¬ 
out replacement, from the sample space W n = jt”,..., ^( n )| where each 
singleton has probability . 

Denote by S n the set of all permutations on W n , endowed with the 
probability P(r n ) = with r n £ S n . 

Any sequential random reordering of (7r n ) ne pj corresponds to a random 
selection of r n £ S n for each n £ N. The permutation r n £ S n identifies 
the reordered fc(n)-tuple of random variables ipi with K{n — 1) < i < K(n), 

n 

where K(n ) = Y k(i). Therefore, the set of all sequential random reorder- 
i= 1 

ings can be endowed with the natural product probability on the space 

OO 

S = U S n . 

n=l 

Theorem 1.1.42. If (7r n ) ne pj is a u.d. sequence of partitions of [ 0,1[, then 
the sequential random reordering of the points If defining them is almost 
surely a u.d. sequence of points in [0,1[. 

1.1.3 Multidimensional case 

In this section we collect results about uniform distribution and discrepancy 
in the multidimensional case. We will see that some of the results stated 
in the first subsection can be naturally extended in the s-dimensional unit 
cube [0, l[ s , while some others require a deeper analysis, as in the case of 
the estimation of the discrepancy. 

In fact, apart from the case s = 1, it is in general very difficult to give 
good quantitative estimates concerning the distribution of multidimensional 
sequences and several problems are still open. 

Let s be an integer with s > 2. We use the following notation: for 
two points a, b £ [0,1 [ s we write a < b and a < b if the corresponding 
inequalities hold in each coordinate; furthermore, we write [a, b[ for the set 
{x £ [0, l[ s : a < x < b}, and we call such a set an s-dimensional interval. 
Moreover we denote by 1 j the indicator function of the set J C [0, l[ s and 
by As the s-dimensional Lebesgue measure (for short we write A instead of 
Ai). Note that vectors will be written in bold fonts and we write 0 for the 
s-dimensional vector (0,..., 0). 


Definition 1.1.43. A sequence (x„) ne pj of points in [0,1 [ s is called uni- 
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formly distributed (u.d.) if 


1 


N 


} im T7 1 [a,b[(Xn) = A s ([a, b[) 


TV—>-oo N 

n= 1 

for all s-dimensional intervals [a, b[C [0,l) s . 

In particular, we can introduce the following function 

1 N 

g(x,y ) = ^m^ ^2 i[o,x[x[o >2 /[ ( x n,y n ), 


(1.17) 


(1.18) 


n=l 


which will be extensively used in the last chapter. We call g the asymptotic 
distribution function (a.d.f.) of a sequence ( x n ,y n ) ne pj in [0,1[ 2 if (1.18) 
holds for every point (x, y) of continuity of g. Moreover, in [33] Fialova and 
Strauch consider 


lim sup — 

N^-oo N 


N 

^2f(x n ,y n ), 

77 .— 1 


where (x n ) n >i, (y n )n>i are u.d. sequences in the unit interval and / is a con¬ 
tinuous function on [0,1[ 2 , see also [93]. In this case the a.d.f. g of (x n , y n )n >l 
is called copula. 


Weyl was the first who extended the uniform distribution property to 
the multidimensional case. His classical results [ 105 . 104] have the following 
multidimensional version. 

Theorem 1.1.44 (Weyl’s Theorem). A sequence (x n ) ne j<j of points in [0, l[ s 
is u.d. if and only if for every (real or complex-valued) continuous function 
f on [0, l[ s the relation 


1 


N 

lim — > 

TV—>■ oc N 

71 =' 


] /(Xn) = [ /(x)d? 

1 ^[0,1] 3 


holds. 

In particular, if g is a copula, then we can write 

i JV r r 

Jim -ttY ]f(x n ,y n )= / / f(x,y)dg(x,y). (1.19) 

AT^ciVZ-. j 0 j 0 

Let x,y eR s and let us denote by x-y the usual inner product in M s , i.e. 

S 

x-y = £ Xiyi. Then we can give the generalization of the Weyl’s Criterion 
2—1 

presented in the first section. 
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Theorem 1.1.45 (Weyl’s Criterion). The sequence (x n ) ng ^ in [0, l[ s is u.d. 
if and only if 

1 N 

lim — V e 2rfl - x " = 0 

N—>oo N 

71 = 1 

for all non-zero integer lattice points h £ Z s \ 0. 

As in the one-dimensional case, a measure for the quality of the uniform 
distribution of a sequence is given by the discrepancy. 

Definition 1.1.46 (Discrepancy). Let u> = {xi,... , xpv} be a finite set of 
points in [0, l[ s . Then the discrepancy of lo is defined as 


D n {u) 


sup 

Je[o,i[ s 


^EMXn)-A s(J) 

n=l 


( 1 . 20 ) 


If (x n ) n£ N is an infinite sequence of points, we associate to it the sequence 
of discrepancies Dn({x i, X 2 ,..., xpv}). So, the symbol -Dpv(x re ) denotes the 
discrepancy of the initial segment {xi, X 2 ,..., xpv} of the sequence. 

Theorem 1.1.47. A sequence (x n ) ng pj of points in [0, l[ s is u.d. if and only 

if 

lim D]\rfx. n ) = 0 . (1.21) 

N—>oo 

If we restrict to subintervals of the form J = [0, aq [x • • • X [0, a s [ with 0 < 
a* < 1, we get the obvious definition of multidimensional star discrepancy. 

Definition 1.1.48 (Star discrepancy). Let u = {xi,..., xpy} be a finite set 
of points in [0, l[ s . Then the star discrepancy of u> is defined as 


D* n (uj) = sup 
J 


^ X] M x n) - A S (J) 

n =1 


( 1 . 22 ) 


where the supremum is taken over all subintervals J C [0,1 [ s of the form 
J = [0, ai[x • • • x [0, a s [ with 0 < a* < 1. 

The relation between discrepancy Dn and star discrepancy D* N in several 
dimensions is very much similar to the one presented for s = 1. 

Theorem 1.1.49. For any sequence (x n ) n6 ^ of points in [0,1 [ s we have 


D* N (x n ) < Dn(^u) < 2 s D* N (x n ) . 


In higher dimensions it is possible to define other forms of discrepancies. 
One of the most used to quantify the convergence in (1.17) is the isotropic 
discrepancy, where the supremum is taken over all convex sets in the unit 
cube [0, l[ s (see e.g. [23] for estimates of the discrepancy of point sets with 
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respect to closed convex polygons). The origin of this new definition is 
strictly related to the integration domain in the right hand side of (1.17). 
We refer to 


for a complete introduction to geometric discrepancy, i.e. 
discrepancy studied for classes of geometric figures other than the axis- 
parallel rectangles, such as the set of all balls, or the set of all polygons or 
polytopes, and so on. 


In the sequel we will present known lower bounds for the discrepancy. 

Proposition 1.1.50. For any finite set u> = {xi,... ,xjy} of points in [0, l[ s 
we have that 

^ < D n (u>) < 1 . (1.23) 

As we have already pointed out, the lower bound is exactly attained only 
in the one-dimensional case. In fact, the following theorem, due to Roth [80], 
illustrates that in the higher-dimensional case examples of sequences having 
discrepancy equal to jj cannot exist. 

Theorem 1.1.51 (Roth’s Theorem). Let s > 2. Then the discrepancy 
Dn(uj) of the point set u> = {xi,..., x_v} C [0, l[ s is bounded from below by 

M > * , (1.24) 


where c s > 0 is an absolute constant given by c s = -. 

2 4 R0-l)log2)T- 

This bound is the best known result for s > 3 and for s = 3 it has been 
slightly sharpened by Beck [10] . 

The first finite sequence of N > 2 points in [0,1 [ s showing asymptotic be¬ 
haviour of the discrepancy of order O ^ ( log ^ has been the Hammersley 

point set [3], for which the constant depends only on the dimension s. 

We will show how to construct this point set in the next paragraph. 

A stronger form of Roth’s Theorem, but with a worse constant, is the fol¬ 
lowing theorem proved by Beck and Chen [Ill- 

Theorem 1.1.52. Let s > 2 and m = {xi,...,x^v} an arbitrary finite 
sequence of N > 1 points in M s . Then there exists an s-dimensional cube 
Q C [0, l[ s with sides parallel to the axes satisfying 


^X] 1 ( x i)-A S (Q) 


> 


c s (log N) s 2 -ds 
N 


C s = 


(log2)V2-3337r-2 
((s-l)!)bt(2s + l) £ f i (6s + l)i 


where 
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and 

d s = sh s+2 3^ . 

It is clear that Roth’s Theorem follows from this one and it is an inter¬ 
esting question whether the converse is also true. This has been proved only 
in the 2-dimensional case by Ruzsa [81], but in general only weaker results 
have been found. 


For infinite sequences in [0, l[ s we only have a long-standing conjecture, 
stating that for every dimension s there exists a constant c s such that for 
any infinite sequence (x n ) ng ^ in [0, l[ s 


Dn^X-ti) — C-s 


^ (logAQf j 


(1.25) 


holds, for infinitely many N and with a positive constant depending only on 
the dimension s. 

It is a widely held belief that the orders of magnitude in (1.24) and (1.25) 
are best possible even if this is only known for (1.24) in the case s = 1,2 
and for (1.25) in the case s = 1 (see Theorem 1.1.11). 


Sequences of points in [0, l[ s having discrepancy of order O ^ are 

called low discrepancy sequences. 

In the last part of this section we will highlight the important role played 
by low-discrepancy sequences in Quasi-Monte Carlo integration and other 
applications to numerical analysis. 


In the following paragraph we show some of the most well-known exam¬ 
ples of multidimensional low discrepancy sequences and point sets, namely 
Kronecker sequence, Hammersley point set and Halton sequence. The first 
one is the obvious generalization of the one-dimensional Kronecker sequence, 
while the second and the third are a finite and infinite generalization of the 
van der Corput sequence. 

Example 1.1.53 (Kronecker sequence). Let 9±,... ,9 S be distinct irrational 
numbers. The s-dimensional sequence 


(x n ) ngN = ({ n9i },..., {n0 s })neN 
is called Kronecker sequence. 

It has been proved by Weyl \10J^ that this sequence is uniformly distributed 
if and only if 1, 9 \,..., 9 S are linearly independent over Q. Indeed it is a 
direct application of WeyVs criterion (1.4) and the proof is exactly the same 
as in the one-dimensional case. 

Concerning the problem of estimating the discrepancy of this sequence, the 
approach used is completely different from that in the one-dimensional case, 
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since there is no canonical generalization of Ostrowski numeration system to 
higher dimensions. This is essetially due to the fact that there is no canon¬ 
ical generalization of the Euclidian algorithm. To cope with the lack of a 
satisfactory tool replacing continued fractions, several approaches are possi¬ 
ble, for instance a classical one is that of best simultaneous approximations, 
introduced by Lagarias [57 1 [58 ] in 1982. 

So almost every Kronecker sequence is an “almost” low-discrepancy se¬ 
quence. 

Example 1.1.54 (Hammersley point set). For a given N E N, an s- 
dimensional Hammersley point set (x„) of size N in [0, l[ s is defined by 

x n = W, • • - n = 0, 1,... ,N - 1 , 

where b\,... , b s - i are given coprime positive integers and is the radical 
inverse function in base bi defined in Definition]! . 1.19 


It is immediate to see that this is an s-dimensional generalization of the 
van der Corput sequence. It has been introduced by Hammersley in his 
survey paper on Monte Carlo methods in order to achieve a smaller error 
bound in the estimate of multidimensional integrals by means of finite sums. 
This problem will be discussed in detail in the next section. 


Halton [43[ proved that the Hammersley sequence has a discrepancy of 
order O ^ ( log ^ ^ . We will see in the next paragraph that this result di¬ 
rectly follows from the following lemma applied to the (s — l)-dimensional 
Halton sequence. 

The following lemma turns out to be a very useful result to contruct differ¬ 
ent low-discrepancy point sets in the s-dimensional unit cube, as soon as we 
know that the sequence (x n ) ne ^ is low-discrepancy in [0,1[® . 

Lemma 1.1.55. For s > 2, let u be an arbitrary sequence of points (x n ) ne ^ 
in [0, l[ s_1 . For N > 1, let P be the point set consisting of (-^,x n ) E [0, l[ s 
for n = 0, 1,... ,N — 1. Then 

ND* n (P) < max MD* m (u) + 1 . 

1 <M<N 

Remark 1.1.56. A Hammersley point set is a finite set of size N which 
cannot be extended to an infinite sequence. In fact, if we want to increase 
the number of points by one we have to compute again all the N point of the 
form jf, losing the previously calculated values. 

It is evident that this is a quite tedious work, especially for applications. 

So now we introduce the second generalization to higher dimension of the 
van der Corput sequence. This new construction gives rise to an infinite 
sequence of points in the unit cube [0, l[ s . 
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Example 1.1.57 (Halton sequence). Let b \,..., b s > 2 be pairwise coprime 
integers. Then the s— dimensional Halton sequence (x n ) ng f^ in [0,1 [ s is the 
sequence of points of general term 


x„ = (Thin), . . .,Tb s (n)) , 


where tp ^ is the radical inverse function in base bi defined in 1.1.19. 


For s = 1 and b\ = 2 we just get the van der Corput sequence. 

The following result is due to Halton and shows that the Halton sequence 
has low-discrepancy. 

Theorem 1.1.58. Let (x n ) ne pj be the Halton sequence in the pairwise rel¬ 
atively prime bases b\..... b s . Then 


1 


d *n«) < jf+x n 


i=l 


bj- 1 
2 log b, 


log N + 


bi + 1 


for all N > 1 . 


From this theorem and Lemma 1.1.55. we get the following result. 

Corollary 1.1.59. If P is the s-dimensional Hammersley point set of size 
N in I s , then 


D* n (P ) < 


s 

N 



bi - 1 
2 log bi 


log N + 




We end this paragraph with two important remarks concerning the dis¬ 
crepancy of the Halton sequences. 

Remark 1.1.60. Apart from the requirement of coprimality in the choice of 
the bases b ±,..., b s in the definition of the Halton sequence, it is also possible 
to optimize this choice in order to get a better discrepancy bound. 

In fact, as we have shown in Theorem 1.1.58, an upper bound for the dis¬ 
crepancy of the Halton sequence is given by 


D* N {*n) < A(b\,.. .,b s )N-\\ogN) s + 0(N-\logN) s - 1 ) 


where the coefficient of the main term is 


A(b u ...,b s )= n 

i=l 


bj - 1 
2 log bi 


So in order to get a better upper bound, we need to minimize this coefficient. 
It is evident that it is minimal if we let bi,, b s be the first s primes pi = 
2,p 2 = 3,... ,p s . 

Of course the same assumption can be made in the case of the Hammersley 
point set and it gives again a better bound for the discrepancy. 
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If we denote by A s the term A(p \,.... p s ) and consider the Prime Number 
Theorem, we obtain 


lim 

s—> oo 


log A s 

s log s 


= 1 . 


Thus A s increases superexponentially as s —> oo and this means that Hat¬ 
ton sequences and Hammersley point sets are actually useful for applications 
only for small dimensions s. 

This phenomenon, generically referred to as “curse of dimensionality”, es¬ 
sentially implies that when the dimension increases, the Halton sequence 
does not show a good asymptotic behaviour. 

To cope with this problem, it has been introduced, first by Spanier m and 
later considered by several authors (see e.g. m ® m w, the 

notion of hybrid sequences. Roughly speaking, they are s-dimensional se¬ 
quences obtained by concatenating d-dimensional low-discrepancy sequences 
with s — d-dimensional random sequences. We will discuss the advantages 
of considering these new sequences in the last part of this section. 


Remark 1.1.61. As we have already pointed out, choosing the first s primes 
instead of arbitrary coprime bases improves the bound for the discrepancy of 
the Halton sequence. Nevertheless, numerical calculations showed strong cor¬ 
relation between components. Correlations between radical inverse functions 
with different bases used for different dimensions reflect on poorly distributed 
two-dimensional projections. The first investigations on this problem are due 
to Braaten and Weller m 

For instance, if we consider the eight-dimensional Halton sequence, we know 
that the last two coordinates are defined by the digit espansion in base 17 and 
19, respectively. As the following picture taken from JS3SI shows, there is a 
strong correlation between the seventh and eighth coordinate. 


The poor two-dimensional projections can be explained by the fact that the 
difference between the two primes bases 17 and 19 corresponding to the di¬ 
mensions 7 and 8 is very small compared to the base size and so the first 18 
points in the two bases have the same expression. 

One could think that one way to avoid this problem is to drop the first 20 
entries, but this is not very convenient since the same problem appears also 
for bigger twin primes. In order to break correlations, the most convenient 
solution is given by a scramble of the original Halton sequence. It is a ran¬ 
domized version of the Halton sequence, in the same way as it has been done 
to obtain the generalized van der Corput sequence in one dimension, based 
on digit permutations. 

An alternative approach suggested in \2Iff is to find an optimal Halton se¬ 
quence within a family of scrambled sequences. 

Other methods are considered in literature to randomize the Halton sequence 
and they make use of the description of the sequence by means of the von 
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Figure 1.1: Projection of the first 100 points of the Halton sequence onto 
the seventh and the eighth coordinates 



Neumann-Kakutani transformation, see e.g. We will analyze 

this transformation and its connection with u.d. sequences in the next chap¬ 
ter. 

As discussed above, even if Halton sequences are low-discrepancy and there¬ 
fore suitable for applications, it is sometimes better to consider point sets 
and sequences whose discrepancy bounds have much smaller constants. 

In view of this consideration, point sets called (t,m, s)-nets and sequences 
called {t, s)-sequences have been introduced by Niederreiter fh5l /. 


1.1.4 Quasi-Monte Carlo methods 

We end this section by presenting the most interesting application of low- 
discrepancy sequences. 

Roughly speaking, quasi-Monte Carlo methods are deterministic versions of 
Monte Carlo methods. 

This last technique consists in the numerical approximation of integrals of 
a function / defined on the s-dimensional unit cube [0, l] s by the average 
function value at the N quadrature points belonging to [0, l[ s . More pre¬ 
cisely, the crude Monte Carlo approximation for the integral of a function 
/(x) on the unit interval [0, l] s is 



1 N 

71 = 1 


where xi,..., xjy are random points from [0, l[ s obtained by performing N 
independent and uniformly distributed trials. 

The first important question arising when implementing an approximation 
technique concernes the type of convergence and the speed of convergence. 
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For the crude Monte Carlo method, the strong law of large numbers guar¬ 
antees that the numerical integration procedure converges almost surely. 
Moreover, it follows from the central limit theorem that the integration error 
is 


1 N 

n— 1 



o[n - 1/2 ) . 


It is important to note that this order of magnitude does not depend on the 
dimension s, as it happens for classical numerical integration where the er¬ 
ror bound is of order O (lV~ 2 / s ). So the Monte Carlo method for numerical 
integration allows us to overcome the curse of dimensionality. 

It is evident that for the practical implementation of the Monte Carlo 
method the fundamental question is how to produce a random sample. Even 
if there are some techniques such as tables of “random” numbers or physical 
devices for generating random numbers such as white noise, there are some 
deficiencies in the Monte Carlo method minating is usefulness. As remarked 
in [66], the main deficiencies of the Monte Carlo method are: 


• there are only probabilistic error bounds, 


• the regularity of the integrand is not reflected, 


• generating random samples is difficult. 


To cope with these three problems one should select the sample points ac¬ 
cording to a deterministic scheme that is well suited for the problem at hand. 
The quasi-Monte Carlo method is based on this idea. 

Recall that Weyl’s Theorem 1.1.44 suggests the possibility of a numerical 
approximation of the integral of a continuous function / defined on [0, l] s 
by the average function value at N u.d. points in [0,1 [ s , i.e. 


1 N 

I™ N £ /(X "> = 

n= 1 


wi 


/(x)dx , 


where xi,... , xjv are u.d. in [0, l[ s . 

Therefore, it is very interesting to get information on the order of this con¬ 
vergence. 

Referring to this problem, a very useful estimate is provided by the Koksma- 
Hlawka inequality which is strictly related to the discrepancy of the se¬ 
quence. 

Before we can present this result, we need to define the variation of a func¬ 
tion / : [0, l] s —> M. 

By a partition 7r of [0, l] s we mean a set of s finite sequences (r/,-°' ) ,..., rj\ m ^) 
for i = 1,..., s with 0 = < r/ ? - 1 ^ < ■ ■ ■ < = 1. In connection with 
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such a partition we define for each i = 1,..., s an operator A * by 

••• ,x s ) = f(x i,... ,Xi-i,r]l j+1 \x i+ i, ...,x s ) 
~ f(xi, ■ • .,Xi-i,ri^\xi + i, ...,x s ) 

for 0 < j < mi. Operators with different indices obviously commute and 
j fc stands for Aq • • • A U; . Such an operator commutes with summation 
over variables on which it does not act. 

Definition 1.1.62 (Function of bounded variation in the sense of Vitali). 
For a function f : [0, l] s —>• M we set 

mi —1 m s — 1 

K (s) (/) = sup ••• , 

p ji=0 js= o 

where the supremum is extended over all partitions P of [0, l] s . 

IfV^(f) is finite then f is said to be of bounded variation on [0, l] s in the 
sense of Vitali. 

Definition 1.1.63 (Function of bounded variation in the sense of Hardy 
and Krause). Let f : [0, l] s —> M and assume that f is of bounded variation 
in the sense of Vitali. If the restriction f l ' F> of f to each face F of [0, l] s of 
dimension 1, 2,..., s — 1 is of bounded variation on F in the sense of Vitali, 
then f is said to be of bounded variation on [0, l] s in the sense of Hardy and 
Krause. 

So we can state the following theorem. 

Theorem 1.1.64 (Koksma-Hlawka’s Inequality). 

Let f be a function of bounded variation on [0, l] s in the sense of Hardy 
and Krause. Let u = (xi,..., xjv) be a finite set of points in [0, l] s . Let us 
denote by u>i the projection of u> on the (s — l)—dimensional face Fi of [0, l] s 

defined by Fi = {(«i,... ,u s ) G [0, l] s : uq =.= u ^ = 1}. Then we 

have 

1 N „ s— 1 

/(x)dx < '%2'52 D N(vi)V( a ~ l) (f iFl) ), (1-26) 

n= 1 ^[0.1] s t= o Fl 

where the second sum is extended over all (s — l)—dimensional faces Fi of 
the form uq = • • • = iq, = 1. The discrepancy D* N (coi) is clearly computed 
on the face of [0, l] s in which ui is contained. 

Hence, the Koksma-Hlawka’s inequality assures that considering a low 
discrepancy s-dimensional sequence in the Quasi-Monte Carlo integration 
leads to an improvement on the Monte Carlo error bound. 

However, in high dimensions the quasi-Monte Carlo method starts losing its 
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effectiveness over the Monte Carlo method. 

Since the error bound in the Monte Carlo approximation does not depend 
on the dimension, several authors found worthwhile to combine the advan¬ 
tages of Monte Carlo and Quasi-Monte Carlo methods, i.e., statistical error 
estimation and faster convergence. As we have already pointed out in the 
section about the convergence rate of the Halton sequence the basic idea is 
to consider the so-called hybrid sequences. 

For a mixed s-dimensional sequence, whose elements are vectors obtained 
by concatenating d-dimensional vectors from a low-discrepancy sequence 
with (s — d)-dimensional random vectors, probabilistic upper bounds for its 
star discrepancy have been provided by several authors, e.g. H1EQ1E21ESI. 
The first deterministic bounds have been shown by Niederreiter m and 
since then other results have been found, see e.g. [68, '30]. 

1.2 Ergodic Theory 

In this section we recall basic definitions and results on the theory of dynam¬ 
ical systems. Our main references for this section will be [2511271130117BHT02]. 
The structure of the section is the following: a first part deals with measure- 
theoretic aspects of the theory, namely measure-preserving transformations, 
property of mixing and ergodicity and examples. 

A second part is concerned with topological aspects, such as invariant mea¬ 
sures for continuous transformations, with a particular attention to dynam¬ 
ical systems associated to a numeration system. 


1.2.1 Preliminary definitions and results 

Before giving formal definitions of the mathematical objects in ergodic the¬ 
ory, it is worthwhile to understand what is ergodic theory about. Roughly 
speaking, it is a part of the theory of dynamical systems. In its simplest 
form, a dynamical system is a function T defined on a set X. The aim 
of the theory is to describe the asymptotic behavior of the iterates of this 
map in a certain point x E X, defined by induction in the following way: 
x = T°(x),T 1 (x) = T(x),T 2 (x) = T(T(x)),... ,T n = T r *- 1 (r(x)). 

This sequence is called orbit of x under T. 

According to the different structure which X and T may have, the theory 
of dynamical systems splits into subfields: 

• Differentiable dynamics deals with actions by differentiable maps on 
smooth manifolds; 

• Ergodic theory deals with measure preserving actions of measurable 
maps on a measure space, usually assumed to be finite; 
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• Topological dynamics deals with actions of continuous maps on topo¬ 
logical spaces, usually compact metric spaces. 

Our dissertation will focus only on the last two aspects of the theory. 

Definition 1.2.1. A measure preserving transformation is a transformation 
T defined on a measure space (X,A,n), such that 

1. T is measurable: E £ A => T~ 1 E 6 A; 

2. n is T-invariant: p(T~ 1 E) = p(E) for all E e A. 

A classic problem in ergodic theory is to find a suitable measure on X 
which is preserved by T. 

We now provide some examples of transformations and the corresponding 
preserved measures. 

Rotations on a circle Let X = [0,1[ , let B be the Borel a-algebra 
and A be the Lebesgue measure on X. Fix a G R. Define T a : X —> X 
by T(x) = x + a mod 1. T a is called a circle rotation, because the map 
R(x) = e 2mx is an isomorphism between T a and the rotation by the angle 
27ra on the unit circle S' 1 . 

In fact, it is well-known that an alternative way to describe the circle S 1 , that 
is often more convenient, is to cut and open the circle to obtain an interval. 
Let 1/ ~ denote the unit interval [0,1] with the endpoints identified: the 
symbol ~ recalls that 0 and 1 are glued together. Then // ~ is equivalent 
to a circle. 

More formally, consider R/Z, i.e. the space whose points are equivalence 
classes x + Z of real numbers x up to integers: two reals x\,X 2 € R are in the 
same equivalence class if and only if there exists fceZ such that x\ = X 2 + k. 
Then R/Z = // ~ since [0,1] contains exactly one representative for each 
equivalence class with the only exception of 0 and 1, which belong to the 
same equivalence class, but are identifyed. 

All these spaces can be considered to describe a rotation on a circle. 

Lemma 1.2.2. T a is measure preserving on X with respect to the Lebesgue 
measure X. 

Doubling map Consider now X = [0,1] equipped with the Lebesgue 
measure A, and define T 2 : X —> X by T-Ax) = 2x mod 1. T 2 is is called the 
doubling map. It is an easy exercise to prove the following 

Lemma 1.2.3. X 2 preserves the Lebesgue measure A on [0,1]. 

Remark 1.2.4. The dynamical system associated to this map yields the 
binary expansion of points in [0,1 [ in the following way. We define the 



Preliminary topics 


27 


ai(x) = 


function aq by 

'0, 0 < x < 1/2 

v l, 1/2 < x < 1 , 

f/ien Tox = 2x — a±(x). Now, for n > 1 set a n (x) = ai(T^ _1 x). Fix 
rewriting we get x = uFere T 2 X = Continuing in 

this manner, we see that for each n > 1, 


ai(x) a 2 (x) 
x = —--1--b 


2 2 


+ a n (x) + T 2 x 


Since 0 < T£x < 1, we get 


x — 


y' ai{x) 

2.^ 2 * 

2=1 


r 2 -x 


as n 


CX). 


77m S; x = E”=i 

We now present an example of a transformation which does not preserve 
the Lebesgue measure A. 


/^-transformations Let X = [0,1 [ and fi > 1 be real. Define the trans¬ 
formation Tp : X —> X by 


Tpx = fix mod 1 = 



if 0 < x < jj 

if ^ < x < 1 . 


(1.27) 


Figure 1.2 illustrates the most well-known /3-transformation, obtained with 

P= 

This family of transformations got a considerable interest since its intro¬ 
duction by Renyi [75] in 1957. He showed that for every /3 > 1 there exists 
a unique normalized measure up, equivalent to the Lebesgue measure and 
invariant under Tp, such that for every element E in the Borel cr-algebra of 
[0,1[ 

/ap(E) = / hp(x)dx , 

Je 

whose density hp is a measurable function satisfying 

1 -1 < W < Dr ■ 


Few years later, Gelfond [38] and Parry m independently found the fol¬ 
lowing explicit form for the density hp defining the measure up, where for 
convenience one defines T^(x) = x and inductively Tjf( 1) = Tjf~ 1 ({/3}): 
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Figure 1.2: Preimage of the interval [a, b[ under the /3-transformation with 
/3 = 4±± 


where C(/3 ) is the normalizing constant defined by 



Remark 1.2.5. By iterating Tp, one can show that every x € X has a series 
expansion of the form 


x = J2 

i=i 


di(x ) 

F 


where the digits djfx) are all elements of the set {0,1,... 3 |_/3 — 1J}. In fact, 
if we define the function d\ by 


di(x) 


0, 0 < x < 1/(3 

1, 1//3 <x <1 , 


then Tpx = /3x — d\{x). For n > 1, set d n {x) = di(Tjf 1 x). Fix x £ X, 

rewriting we get x = where Tpx = 4 —Continuing in 

this manner, we see that for each n> 1, 

_ d\(x) d 2 (x) d n (x) Tjfx 

/3 + /3 2 + " ' + pn. + pn ■ 

Since 0 < Tjfx < 1, we get 




di(x) 

F 


TJfx 

{3 n 


as n 


oo. 
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Thus, x = ElLi 

Renyi UM was the first to define the representation of real numbers in 
base (3, generalising expansions in integer bases. 

New generalizations, such as {a, ft) -expansions f 26] and {—fi)- expansion 
have been recently introduced. We will talk more about fi-expansions of real 
numbers in the next subsection in relation to low-discrepancy sequences. 

Remark 1.2.6. A reasonable question about the density defining the Tp- 
invariant measure concerns the sum involved in the definition. More pre¬ 
cisely one can ask under which conditions the sum is finite. The answer to 
this question has been given by Parry and is related to the ft-expansion 
of P, i.e. if fi has a recurrent tail in its P-expansion, then hp is a step 
function with a finite number of steps. 

Definition 1.2.7. Let T : X —>• X be a measure preserving transformation 
on a probability space (X,A,n). The map T is said to be ergodic if for every 
A E A satisfying T -1 (y4) = A, we have n(A) = 0 or /jl{A) = 1. 

We will use later an equivalent property: T is ergodic if and only if 
for any measurable set B with p,(B) > 0, the set B 7 = Uit^oc T l (B) has 
measure 1. 

A useful characterization of ergodic transformations is given by the following 
lemma. 

Lemma 1.2.8. Let (X,A,/a) be a probability space and let T : X —* X be 
a measure-preserving transformation. Then T is ergodic if and only if the 
only T-invariant measurable functions f, i.e. satisfying f o T = f (up to 
sets of measure 0 ), are constant almost everywhere. 

For the straightforward proof, we notice that if the condition in the 
lemma holds and A is an invariant set, then 1a°T = 1a almost everywhere, 
so that 1 a is an a.e. constant function and so A or X \ A is of measure 0. 
Conversely, if / is an invariant function, we see that {x : f(x) < a} is an 
invariant set for each a and hence of measure 0 or 1. It follows that / is 
constant almost everywhere. 

A basic problem in ergodic theory is to determine whether two measure 
preserving transformations are measure theoretically isomorphic. 

To answer to this question it seems natural to introduce the unitary operator 
Ut associated to T on LjfiX). 

Given a a measure-preserving map T on a function space the associated 
induced operator Ut ■ L 2 ^ —> L 2 ^ is defined by 


U T (f) = f oT . 
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Since L 2 ^ is a Hilbert space, then for any two functions fi, f 2 G L 2 

{U T f \, Urf-i) = j fx o T ■ J^Tdg 

= J f\ .f-id/i 

= (/i,/ 2 ). 

where the second equality holds since g is T-invariant. 

Thus Ut is an isometry mapping L 2 into L 2 whenever ( X , B, \i, T ) is a 
measure-preserving system. 

If U : Hi —> V .2 is a continuous linear operator between Hilbert spaces, then 
the relation 

(Uf,g) = (f,U*g) 

defines an associated operator U* : % —> T~Li called the adjoint of U. The 
operator U is an isometry, i.e. ||£/7 i||% 2 = ||/i|| for all h G Hi, if and only if 


U*U = Id Hl , 

where Id-^ 1 is the identity operator on Hi and 

UU* = 7T lmU , 


where tt\ m u is the projection operator onto Im U. 

Finally, an invertible linear operator U is called unitary if t/ -1 = U, or 
equivalently if U is invertible and 

\{Uhi,Uh 2 )\ = \(hi,h 2 )\ 


for all hi,h 2 G Hi- If U : Hi H 2 satisfies this last relation then U is an 
isometry (even if it is not invertible). 

Thus for any measure-preserving transformation T, the associated operator 
Ut is an isometry, and if T is invertible then the associated operator Ut is 
a unitary operator, called the associated unitary operator of T or Koopman 
operator of T. 

Properties of T that are preserved under unitary equivalence of the as¬ 
sociated unitary operators are called spectral properties. Tf and T 2 are 
called spectrally isomorphic if their associated unitary operators are unitar- 
ily equivalent. 

The following lemma shows that ergodicity is a spectral property and it is 


a direct corollary of Lemma 1.2.8 


Lemma 1.2.9. A measure-preserving transformation T is ergodic if and 
only if 1 is a simple eigenvalue of the associated operator Ut- 
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For a concise treatment of the spectral theory relevant to ergodic theory, 
we refer to Parry’s book m- 

All examples of measure preserving transformations considered above are 
also examples of ergodic transformations. In particular, it is an easy appli¬ 


cation of Lemma 1.2.8 to show that T a : X —> X defined by T(x) = x + a 


mod 1 and T 2 : X —> X defined by ) = 2x mod 1 are ergodic. 

The proof of the ergodicity of Tp is a bit more involved. A proof can be 
found in Renyi [78j and it makes use of the following 

Lemma 1.2.10 (Knopp’s Lemma). Let B be a Lebesgue set and C be a class 
of subintervals of [ 0 , 1 [ such that 

1. every open subinterval of [ 0 , 1 [ is at most a countable union of disjoint 
elements from C 

2. VA E C A (A n B) > qA(A) with 7 > 0 independent of A. 

Then A (B) = 1. 

In general, checking that a given measure-preserving transformation is 
ergodic is a non-trivial task. We will discuss other stronger properties that 
a measure-preserving transformation may enjoy, and that in some cases are 
easier to check. 

If T is ergodic with respect to fi, then the following result due to Birkhoff 
[ 12 ] holds. 

Theorem 1.2.11 (Birkhoff’s Theorem). Let (X, A, fx,T) be a measure the¬ 
oretical dynamical system. Then, for every f E L* (A), the limit 


AT— 1 



n =0 


exists for p-almost every x E X (here T°x = x). If T : X —>• X is ergodic, 
then for every f E T l (X) we have 


AT— 1 



(1.29) 


for /a-almost every x E X. 

In particular, an immediate consequence of the previous theorem is that 
the orbit ( T n x) n of x under T is a u.d. sequence for almost every lEl 
whenever T is ergodic. 


The following lemma follows easily from Birkhoff’s Theorem. 
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Lemma 1.2.12. Let T be a measure-preserving transformation on the prob¬ 
ability space (X,A, n). Then T is ergodic if and only if for all A, B G A we 
have 

N—l 

^lo N S T ~ nA n B ) = T(A)v(B) . (1.30) 

n=0 

Recall from abstract probability theory that two events A, B are inde¬ 
pendent if fa(A n B) = fi(A)fi(B). Also recall that a sequence c n is said to 
Cesaro converge to a if 


N—l 


lim — 

N —»oo N ' 


Cm — Cl 


3=0 


Thus T is ergodic if and only if the Cesaro averages of the sequence p(T~ n AC\ 
B ) converge to p,{A)n{B). That is, given two sets A,B e A, the sets 
T~ n A , B approach independence as n tends to infinity in some appropriate 
sense. 


1.2.2 Classical constructions of dynamical systems 

In this section we discuss several standard methods for creating new mea¬ 
sure preserving transformations from old ones. These constructions appear 
quite frequently in applications. 

Before giving explicit examples of constructions of dynamical systems, we 
provid some useful relations between measure-preserving transformations. 
For instance, one interesting notion is that of isomorphism for measure¬ 
preserving transformations, i.e. when two measure-preserving transforma¬ 
tions can be considered to be the same or equivalent. 

Definition 1.2.13. Let (A*, At, for (i = 1,2) be two measure-preser¬ 

ving systems over a probability space. We say that T\ is isomorphic to T 2 if 
there exist A £ A\ and Belj with Hi(A) = l, /. 12 (B) = 1 such that 

1. T X A c A, T 2 B C B 

2. there exists an invertible measure-preserving transformation 

<j>:A->B 


with 

<t>Ti(x) = T 2 (0(s)) 

for all x e A, where A and B are assumed to be equipped with the 
a-algebras AnA\ and BnA 2 , respectively, and the restrictions of the 
measures m to these a-algebras. 
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In this case we write T\ ~ Ti- 

We have already seen that the map fi(x) = e 2mx is an isomorphism be¬ 
tween T a and the rotation by the angle 27 tq; on the unit circle S' 1 and that 
the map <f)(x) = e 2mx is an isomorphism between the doubling map T -2 and 
the map e l6 —> e 2ld on S' 1 . 

This dehnition will be particularly useful in Chapter 4 when we will con¬ 
struct new ergodic systems with a prescribed property via an isomorphism 
with a system that has the desired property. In particular, we will refer to 
the following example that can be found in [ 321 Example 2.4], 

Example 1.2.14. Let Lb be the compact group of b-adic integers and r : 
Zb —> Zb the addition-by-one map (called odometer). Our goal is to find 
an isomorphism fib between r and a transformation T on [0,1[, such that 
T = fi b o t o fi- 1 . 

For an integer b > 2, every z E Zb has a unique expansion of the form 

z = Yl z ^ 

j> 0 

with digits Zj E {0,1,... , b — 1}. For z E Zb we define the b-adic Monna 
map fi b : 'Lb —> [0,1[ by 

fib ) ='52 z i b ~ 3 ~ 1 - 

\j >o / j >o 


The restriction of fib to No is the radical-inverse function in base b (see Def¬ 
inition 1.1.19) that gives rise to the van der Corput sequence in base b. 


The Monna map is continuous and surjective but not injective. In order 
to make it an isomorphism we only consider the so-called regular representa¬ 
tions, i.e. representations with infinitely many digits Zj different from 6—1. 
The Monna map restricted to these regular representations admits an inverse 
(called pseudo-inverse) fif : [0,1) —> Lb, defined by 


^b 1 ( J2 z i b 3 1 ) = J2 z 3 lyl > 

\ j >0 / 3 >0 

where [C ? >o z jb ^~ X is a b-adic rational in [0,1). 

Moreover fib is measure preserving from Lb onto [0,1[ and T = fib ° r o 
fifi 1 : [0,1[— > [0,1[. Hence fib and T are isomorphic. 
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Remark 1.2.15. For an isomorphism between two measure-preserving trans¬ 
formations, the following properties hold. 

1. Isomorphism is an equivalence relation; 

2. If T\ ~T- 2 , then Tf ~ TJf for every n > 0 . 

Isomorphism is actually a relation which is in many cases too strong. A 
weaker and useful condition is conjugacy. 

Definition 1.2.16. Let ( X,A,/i ) be a probability space. Define an equiva¬ 
lence relation on A by saying that A and B are equivalent (A ~ B) if and 
only if ii(A A B) = 0. Let A denote the collection of equivalence classes. 
Then A is a Boolean a-algebra under the operations of complementation, 
union and intersection inherited from A. The measure fi induces a measure 
fi on A by fi(B) = p(B). The pair (A,ff) is called a measure algebra. 

A map $ : (A- 2 , F 2 ) —> (All, /Ji) is called an isomorphism of measure-algebras 
if it is a bijection that preserves complements, countable unions and satisfies 
/li(&(B)) = fi 2 (B) for every B £ A 2 . 

Let (Xi, Ai, m) for (i = 1,2) be two probability spaces with correspond¬ 
ing measure algebras (Ai,fli). If f> : X\ —> X 2 is measure-preserving, then 

we have a map c/> _1 : (_4.2,M 2 ) —t defined by <f>~ 1 (B) = (j)~ l (B). 

This map is well-defined since f> is measure-preserving. The map pre¬ 
serves complements and countable unions (and hence countable intersec¬ 
tions). Also fLi(<f>~ 1 (B)) = fi 2 (B) for every B £ A 2 . Therefore <fi can be 
considered a homomorphism of measure algebras. Note that f)~ l is injective. 

Definition 1.2.17. Let T,; be a measure-preserving transformation on the 
probability space (Xi,Ai,m), i = 1,2. We say that T\ is conjugate to X 2 
if there exists a measure-algebra isomorphism 4> : (A 2 , F 2 ) -4 Mi,/A) such 

that <s>f 2 ~ l = fr 1 ^. 

Conjugacy is also an equivalence relation and all isomorphic measure¬ 
preserving transformations are conjugate, as stated by the following 

Theorem 1.2.18. Let T\ be a measure-preserving transformation on the 
probability space (Xj, Ai, Pi), i = 1,2. If T\ is isomorphic to T 2 , then T\ is 
conjugate to T 2 . 

In some cases, conjugacy can also imply isomorphism. 

We have already seen that another way to compare two measure-preser¬ 
ving transformations is to consider the associated unitary operator. We re¬ 
call briefly that T\ and X 2 are called spectrally isomorphic if their associated 
unitary operators are unitarily equivalent. 

We briefly remind what is an eigenvalue of a measure-preserving transfor¬ 
mation. 
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Definition 1.2.19. Let T be a measure-preserving transformation T on a 
probability space and let Ut be the induced linear isometry of L 2 . 

The eigenvalues and eigenfunctions of Ut are called the eigenvalues and 
eigenfunctions of T. So a complex number A is called an eigenvalue of T if 
there exists a non-zero function f 6 L 2 ^, satisfying Urf = A/. The function 
f is called an eigenfunction of T corresponding to the eigenvalue A. 

Definition 1.2.20. An ergodic measure-preserving transformation T on a 
probability space (X, A, p) is said to have discrete spectrum (or pure point 
spectrum) if there exists an orthonormal basis for L 2 l consisting of eigen¬ 
functions of T. 

The following result shows that spectral isomorphism is weaker than 
conjugacy. 

Theorem 1.2.21. Let T{ be a measure-preserving transformation on the 
probability space (Xi, Ai, Hi), i = 1,2. IfT\ and T 2 are conjugate, then they 
are spectrally isomorphic. 

There are instances when spectral isomorphism implies conjugacy. 

Then, summarising, we have the following definition. 

Definition 1.2.22. A property P of a measure-preserving transformation 
is an isomorphism, or conjugacy or spectral invariant if the following holds: 
Given T\ has P and T 2 is isomorphic, or conjugate or spectrally isomorphic, 
to Tj then T 2 has property P. 

Now, since isomorphism implies conjugacy and conjugacy implies spec¬ 
tral isomorphism, a spectral invariant is a conjugacy invariant and a conju¬ 
gacy invariant is an isomorphism invariant. 

As we have already pointed out, any two measure-preserving transfor¬ 
mations that are conjugate are also spectrally isomorphic. Now, if two spec¬ 
trally isomorphic measure-preserving transformations have discrete spec¬ 
trum, then they are conjugate. Thus, the property of discrete spectrum is 
very important and depends upon the eigenvalues of the measure-preserving 
transformation. Note the if two measure-preserving transformations are 
spectrally isomorphic then they have the same eigenvalues. 

The following theorem proved by Halmos and von Neumann in 1942 
shows that the eigenvalues determine completely whether two transforma¬ 
tions with discrete spectrum are conjugate or not. 

Theorem 1.2.23 (Discrete Spectrum Theorem). Let T\ and T 2 be ergodic 
measure-preserving transformations with discrete spectrum of the probability 
spaces (Xi, Ai, fii) for (i = 1,2). Then the following are equivalent: 
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1. T\ and X 2 are spectrally isomorphic; 

2. T\ and T 2 have the same eigenvalues; 

3. T\ and X 2 are conjugate. 

Corollary 1.2.24. If T is an invertible ergodic measure-preserving trans¬ 
formation with discrete spectrum, then T and T _1 are conjugate. 

Remark 1.2.25. If the spaces pf) for (i = 1,2) are both complete 

separable spaces, then the statements of Theorem \1.2.23\ are equivalent to T\ 
being isomorphic to Ti- 

Let us now turn our attention to a collection of ergodic measure-preserving 
transformations which have discrete spectrum (see |102( §3.3] for details). 

Example 1.2.26. Let S ' 1 be the complex unit circle and suppose T : S 1 —» S 1 
is defined by T(z) = az where a is not a root of unity. We know that T 
is ergodic and is a rotation of a compact group. Consider the sequence of 
functions f n : S 1 —> C defined by f n (z ) = z n . Then f n is an eigenfunction of 
T corresponding to the eigenvalue a n . Since (f n ) forms a basis for L^S 1 ), 
we see that T has discrete spectrum. 

The following two theorems completely solve the conjugacy problem for 
ergodic rotations with discrete spectrum. 

They can be found in [ 102 1 , Theorem 3.6, Theorem 3.7] and we provide a 
proof for the second one since it does not appear in the reference and it 
seemed to be of some interest for the reader. 

Theorem 1.2.27 (Representation Theorem). An ergodic measure-preserving 
transformation T with discrete spectrum on a probability space (X , A, p) is 
conjugate to an ergodic rotation on some compact abelian group. The group 
is metrisable if and only if (X, A, p) has a countable basis. 

Theorem 1.2.28 (Existence Theorem). Every subgroup K of S l is the 
group of eigenvalues of an ergodic measure-preserving transformation with 
discrete spectrum. 

Proof. Let I\ be a subgroup of S l and consider the following family indexed 

by K _ 

Gt = {nt\n 6 Z} t 6 K . 

Then consider the infinite product 

c=ne,. 

teK 

This is a compact group and the Haar measure p is defined on it. The 
following functions gt : G ^ C defined by 

9t((x s ) seK ) = e 2wixt 
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are characters of G. 

We want to show that there exists an ergodic measure-preserving transfor¬ 
mation S with pure discrete spectrum K. such that these characters are the 
eigenfunctions of S associated to the eigenvalues in K. 

Define S : G —> G to be the map ( xt)te.K ^ ( x t + t)teK- Then it follows 
that 


g t (S(x s ) se K ) = 9 t((x s + s ) s£K ) = e 2m{ - Xt+t) 

= e 2mt g t ((x s ) seK ) ■ 

Hence the characters gt are eigenfunctions of S. Moreover, since G is com¬ 
pact, the characters gt form an orthonormal basis for L 2 (g). Finally, S is 
ergodic since for every non-empty open subset U of G we have Uff = _ oc S n U = 
G. □ 

Products 

The product of two measure spaces (Xt, At, gt) for (i = 1, 2) is the measure 
space (Xi XX 2 , Ai< 8 >A 2 , gi x 92 ) where A 1 XA 2 is the smallest a algebra which 
contains all set of the form A\ x A 2 where A. L G At , and g \ x g 2 is the unique 
measure such that (g\ x g 2 )(A\ x A 2 ) = gi{A{)g 2 {A 2 ). This construction 
captures the idea of independence from probability theory: if (X,, A t , gt) are 
the probability models of two random experiments, and these experiments 
are “independent”, then (A'i x X 2 , A\ ®A 2 ,gi x ^ 2 ) is the probability model 
of the pair of experiments. 

Definition 1.2.29. The product of two measure preserving systems (Xi, At, 
gt, Tf) for (i = 1, 2) is the measure preserving system (X\ x X 2 , Mi(DM 2 , gi x 
92 -Ti x T 2 ), where (Ti x T 2 )(x 1 ,x 2 ) = (T 1 x 1 ,T 2 x 2 ). 

We now show that the product of two ergodic measure preserving trans¬ 
formations is not always ergodic. 

Theorem 1.2.30. Let (Xi, Ai, gi,Tf) for (i = 1,2) be two ergodic systems. 
Then T\ x T 2 is ergodic if and only if Ut, and Ut 2 have no common eigen¬ 
values other than 1 . 

Induced transformations 

A central problem in ergodic theory is that of recurrence, concerning how 
points in measurable dynamical systems return close to themselves under 
iteration. The first and most important result is due to Poincare [77] in 1890 
who proved it in the context of a natural invariant measure in the three- 
body problem of planetary orbits, before the creation of abstract measure 
theory. Poincare recurrence is the pigeon-hole principle for ergodic theory; 
indeed on a finite measure space it is exactly the pigeon-hole principle. 
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Theorem 1.2.31. Let T : X —> X be a measure-preserving transformation 
on a probability space (X,A,p), and let E C X be a measurable set. Then 
almost every point x £ E returns to E infinitely often. That is, there exists 
a measurable set F C E with p(F) = p(E) with the property that for every 
x £ F there exist positive integers n\ < n 2 < ... with T n ' 1 x £ E for all 
i > 1. 

In the following example we show that the Poincare recurrence does not 
necessarily hold if the measure space is not of finite measure. 

Example 1.2.32. The map T : M —> M defined by T(x) = x + 1 preserves 
the Lebesgue measure A on M. For any bounded set E CK and any x £ E, 
the set 

{n > 1 : T n x £ E} 

is finite. Thus the map T exhibits no recurrence. 

Now let A be a measurable set with p(A) > 0. By the Poincare recur¬ 
rence, the first return time to A. defined by 

ua(x) = inf{n : T n (x) £ A} 

n> 1 

exists, i.e. is hnite, almost everywhere. 

Definition 1.2.33. The map Ta : A -A A defined (almost everywhere) by 

T a (x) = T nA ^\x) 

is called the transformation induced by T on the set A. 

Observe that both n A : X —> N and T A '■ A —$■ A are measurable, since 
for every n > 1, we can write A n = {x £ A \ n a (x) = n}. Then the sets 

Ai = A fl T _1 A , 

A 2 = AnT~ 2 A\A 1: 

A n = AnT~ n A\\jAi 

i<n 

are all measurable, as it is 

T n A n = A n T n A \ (TA U T 2 A U • • • U T n ~ l A) , 
since T is invertible by assumption. 

Proposition 1.2.34. The induced transformation T A is a measure-preser¬ 
ving transformation on the space (A, 13(A), /i A , T 4 ), where 13(A) = {E 0 A : 
E £ B}, [i A is the measure ha(E) = /a(E\A) = p(E 0 A)/fi(A). If T is 
ergodic with respect to p, then T A is ergodic with respect to p A . 
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T 2 (A) \(j4u T(A)) 


H-h 


T(A) \ A 



T 


Ao 


A 


Figure 1.3: The induced transformation Ta 


As pointed out in m, the effect of Ta can be seen in the situation 
described by Figure [131 called the Kakutani skyscraper. 

The original transformation T sends any point with a floor above it to 
the point immediately above on the next floor, and any point on a top floor 
is moved somewhere to the base floor A. The induced transformation Ta 
is the map defined almost everywhere on the bottom floor by sending each 
point to the point obtained by going through all the floors above it and 
returning to A. 

The Poincare recurrence says that for any measure-preserving system 
(. X, B, i-i, T) and any set A of positive measure, almost every point on the 
ground floor of the associated Kakutani skyscraper returns to the ground 
floor at some point. Ergodicity strengthens this statement to say that al¬ 
most every point of the entire space X lies on some floor of the skyscraper. 
However, the Poincare recurrence does not tell us how long we should have 
to wait for this to happen. One would expect that return times to sets of 
large measure are small, and that return times to sets of small measure are 
large. This is indeed the case, and forms the content of Kac’s Lemma |52j 

Theorem 1.2.35 (Kac’s Lemma). Let (X, £>, n,T) be an ergodic measure¬ 
preserving probability system and let A 6 B have strictly positive measure. 
Then the expected return time to A is j^aS’ equivalently 

/ nAdp, = 1 . 

J A 

As it will appear clear in Chapter 3, Kakutani skyscrapers are a powerful 
tool in ergodic theory. From the Kakutani skycraper construction we can 
deduce a very useful lemma of Kakutani [53] and Rokhlin m often called 
Rokhlin’s lemma. 
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Lemma 1.2.36. Let T : X —> X be an ergodic measure preserving transfor¬ 
mation on a non-atomic probability space ( X , B, p). Then for any n > 1 and 
e > 0 there exists a measurable set B C X such that B,TB,..., T n ~ 1 B are 
pairwise disjoint and p (Li r f~QT' l (B)') > 1 — e. The collection {B,TB,..., 
T n ~ 1 B} is referred to as a Rokhlin tower of height n for the transformation 
T. 


Interval exchange 

The class of interval exchange transformations was introduced by Sinai |89j. 
An interval exchange transformation is the map obtained by cutting the 
interval into a finite number of pieces and permuting them in such a way 
that the resulting map is invertible, and restricted to each interval is an 
order-preserving isometry. More formally, we have the following definition. 

Definition 1.2.37. Let d > 2 be a natural number and let ir be an irreducible 
permutation of {1,... ,d}, that is, 7r{l ,...,&} / {1 ,...,k}, for any 1 < 
k < d. Moreover, let be the set of vectors (Ai,..., A^) in M. d such that 
0 < Xi < 1 for all i and J 2 i=i Aj = 1 . 

An interval exchange on [0,1 [ is a map Ty n : [0,1[ —> [0,1 [ such that it 
is the piecewise translation defined by partitioning the interval [0, 1 [ into d 
sub-intervals of lengths Ai, A 2 ,..., A r and rearranging them according to the 
permutation it; formally 

T\,-jr (x) X T ^ ^ A 7 j-(y) ^ ' A j , 

j<i j<i 

when x is in the interval 


h = 


j<i j<i 


It is easy to see that an interval exchange is a map of [0,1[ into itself 
which is one-to-one, preserves the Lebesgue measure A and is continuous A- 
almost everywhere. Masur m and Veech [ 100j independently showed that 
for almost all values of the sequence of lengths A j, 1 < i < d, the interval 
exchange transformation is ergodic. In fact they proved unique ergodicity, 
which we will discuss in the last part of this section. 


Cutting-stacking 

The cutting-stacking method is a useful tool to construct interval exchanges. 
Its first fomulation is due to von Neumann and Kakutani and then gener¬ 
alised by Friedman 36 j. 

We refer to the description made in [32] where it is used to build sequences 
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in the unit interval with good discrepancy. 

Before starting with the description of the method, we need to fix the nota¬ 
tion. 

We will call columns and denote them by C = (4,..., 4) (also called tow¬ 
ers) a set of disjoint subintervals Ij = [cj,dj[ of [0,1[ having the same length. 
The length of the interval Ij is called the width of C and denoted by 1(C). 
The interval 4 is called the bottom of C, the interval Ih is called the top 
of C, the union supp(C') = U i = jlj is the support of C and the integer h its 
height. With the column C is associated a translation map 

T c '■ supp(C') \ 4 -4 supp(C) \ h 


defined by 


T c (x) =x + (c j+ 1 - cj) 


if x G Ij , 1 < j < h. We represent a column C = (4,..., 4) by drawing 
each interval Ij+i, 1 < j < d above the interval Ij. 

Consider now a given finite set of columns S = {C ±,..., C s } with disjoint 
supports. We associate to S the map T$ which coincides with Tq 1 for 1 < 
i < s. By extending the above notation, we have supp(S’) = Uf =1 supp(C)) 
is the support of S and w(S) = Ylt=i K^i) is the width of S. In the sequel, 
we usually assume that the columns Ci of S are indexed according to the 
order of their bottoms, the one induced by the natural order of [0,1[. 

A cutting of a column C = (I±,..., Ih) in t columns is the set of columns 
Ci = { 4 , 1 ,..., 4, h} such that U* =1 4,i = h and each map T]c, is th e restric¬ 
tion of Tc on Ci. More generally, a cutting of a set S of columns is obtained 
by collecting all columns resulting by cutting part or all columns from S 
and then producing a new set of columns S' = {C (,..., C ' s ,}. 

Now, a stacking of a column C' = (/{,..., I' h ,) above a column C = (4,..., 
Ih) having same width and disjoint support is by definition the column 
C*C' = (h,...,I h ,I[,...,I' hl ). 

The map Tc*c' extends both Tc and T' c and Tc*c' translates Ih onto I(. 
One can also introduce the empty column () of height 0 to set by definition 
C*() = ()*C = C for any column C. 

A sequence S = (S m )m>o of sets S m of columns is said to be complete if 
supp(S'o) = [0,1 [, lim m _^ 0O w(S m ) = 0 and for each m > 1, S m+ \ is built 
from S m by performing cutting and stacking but a finite number of times. 
By construction 4s m+1 extends 4s m . We denote by top(5 m ) (resp. bot(5 m )) 
the union of top (resp. bottom) intervals of columns in S m . 

Clearly top(5 m +i) C top(5 m ), bot(<S' m+ i) C bot(S’ m ) and the intersections 
top(S) = n m > 0 top(S’ m ), bot(S) = H m >obot(5 m ) are at most countable and 
finite if the numbers of columns in infinitely many S n are bounded. Clearly, 
the map 7g m is not defined on top(S) but it is easy to prove that for a com¬ 
plete sequence S there is a unique transformation T : [0, l[\top(E) —> [0,1 [ 
which extends all the 4s m ’s. 
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Moreover, T is a measure-preserving map of ([0,1[, A), well defined on [0,1[\ 
top(S) and invertible on [0, l[\bot(S). 

The transformations obtained in this manner are called staircase trans¬ 
formations. A transformation created by a cutting and stacking with a single 
column resulting from each iteration is a rank-one transformation. We refer 
to [32] and [35] for more details. Rank-one transformations are measurable 
and measure-preserving under Lebesgue measure. 

We now present the simplest example of rank-one transformation ob¬ 
tained by means of the cutting-stacking technique. 

The von Neumann-Kakutani odometer Let us consider the unit in¬ 
terval as a column and let us denote it by So = {[0,1[}. Let us split So into 
two subintervals, [0,1/2[ and [1/2,1[ and stack them one on the other in or¬ 
der to form the column S\ = {[0,1/2[, [1/2,1[}. The heigh of Si is h(S\) = 2 
and its width is w(Si) = 1/2. We define the map Tj (x) = x + \ as 
the translation of the first subinterval of Si onto the second one. At the 
second step we cut each interval of Si into equal parts, take the right half 
and stack it onto the top of the left half. So we get the column S 2 = 
{[0,1/4[, [1/2, 3/4[, [1/4, l/2[, [3/4,1[}, with h(S 2 ) = 4 and w{S 2 ) = 1/4. To 
this second step we associate the map Tj (x) = xJr \- We can visualize 
the above steps of the cutting-stacking procedure in Figure 2. 

The procedure goes on this way, splitting each interval of the column 
S n —i in half and stacking the right column of intervals onto the left column. 
So we obtain the sequence of columns 

Sn = {[M0), M0) + 2~ n [, [02(1), 02(1) + 2-"[,..., [Mn - 1), 1[} , 

with h(S n ) = 2 n_1 and w(S n ) = It is worthwhile to note that the left 
endpoints of the intervals are exactly 4 > 2 (n), the radical inverse function of 
n in base 2. 

Let us recall that the associated sequence ( <t> 2 {n)) n >o is the already de¬ 
scribed van der Corput sequence. 

Inductively, the transformation T associated to this construction and 
called Kakutani-von Neumann odometer is defined on the countable sequence 
of intervals considered above by 

T k (x) = x - 1 + 2~ k + 2 _fc_1 for x € [1 - 2~ k , 1 - 2~ fc ” 1 [ and k > 0 . 

Figure 3 shows the transformation T on [0, l/2[U[l/2, 3/4[U[3/4, 7/8[. 

Lambert [59] proved that the sequence obtained as the orbit of 0 un¬ 
der the von Neumann-Kakutani transformation T is the van der Corput 
sequence, {(j} 2 {n)) n >o- 
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Figure 1.4: Partial graph of the dyadic odometer 



Figure 1.5: Partial graph of the von Neumann-Kakutani transformation 


The von Neumann-Kakutani odometer is one of the simplest examples of 
a rank one transformation and, consequently, it is ergodic, even if ergodicity 
can be proved directly. We now consider a slightly different variation of the 
cutting-stacking technique according to a substitution a. 

Let A be a non empty set, called alphabet, of s elements, called letters. 
Usually we take A = {1,..., s}. A word w of length \w\ = n on A is an 
ordered string w i... w n of n letters Wj in A. A word of length 0 is called 
empty word and denoted by A. For any letter a, the number of occurrences 
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of a in w is denoted |ir| a . Hence |u;| = ^ agj4 IHa- 

We denote by A* the set of words over the alphabet A, equipped with the 
concatenation law (v\ ... v m )(w\... w n ) = (rq ... v m w\ ... w n ). A* is the 
free monoid generated by A. where the empty word is the neutral element. 


Definition 1.2.38. A monoid endomorphism a : A* —» A* is called a sub¬ 
stitution if |<r(a)| > 1 for all letters a E A. If a (a) = A for at least one 
letter, we say that a is a pseudo-substitution. 


If A = {1,..., s}, then the following matrix M(a) 


M(a) 


/V(i)li 

W)i. 


cr(s)| 

cr(s)| 


is called the companion matrix of a. It will play a fundamental role. Let V s 
be the set of positive column vectors I E I s , i.e. such that all entries U of 
l are positive. For any couple (1,1') of vectors in V s , we say that l 1 derives 
from l by a, and we write la —> l', if the relation 


l = M(a)l' 


holds. 


Now let us consider the cutting-stacking process introduced in [42] j where 
starting from a set of columns S = {C\ , ..., Cg} we can build another set of 
columns S' = {C[,... ,C' S }, according to a. 

To do so, let l be the column vector in V s with entries f = l(Ci) and assume 
that there exists V E V s such that l' derives from l by a. Now cut each 
column Cj in order to create a set of \a\j = J2k=i I a (k)\j (sum of entries of 
the j-th line of M(a )) sub-columns Sj = {Cy i,..., Cj^.} such that \a(k)\j 
of them have width l',. Then, for each k, build the column C by stacking 
Ylk =l I a (k)\j sub-columns such that 

• l cr (^)lj sub-columns come from the sub-columns of width l' k in Sj] 


• from the bottom to the top the column CL is built according to the 
word a(k) = <x M ... 0k,\<r(k)\ 


cl = hf, • 


T (2) * 

O-fc ,2 


T (wm 

a k,\cr(k)\ 


(?) 

where Tf k ’^ is a column from S ak . not used yet. In the standard 

(i) 

construction, we select the successive Ta,' :i in C k from left to right. 
This construction is not unique but at least a standard one exists due 
to l = M(a)l'. When we use a standard construction we say that the 
couple (S', l') derives from (S, l) by taking into account the derivation 

l = M(a)V. 
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At this point we want to define the transformation T a obtained by the itera¬ 
tion of the above derivation process. To make it possible we need to consider 
a particular class of substitutions on A, namely adapted substitutions and 
introduce some definitions. 

Definition 1.2.39. A letter a £ A is said to be expansive for the substitution 
o if the increasing sequence n t-4 |<r n (a)| is unbounded. 

We denote by E(a) the set of all expansive letters for a in A. 

Definition 1.2.40. A substitution a is called adapted if E(a) / 0 and for 
all expansive letters a and all letters x £ A, there exists an integer k > 1 
such that | (J k {a)\ x > 1. 

Definition 1.2.41. Let a be an adapted substitution. The period h of a is 
the period of the companion matrix of ■ Therefore, h is given from any 
expansive letter a by 


h = gcd {k > 1; |cr|(a)| a > 1} . 

Now we can state a useful theorem that can be seen as a generalization of 
the Perron-Frobenius Theorem, already considered at the beginning of this 
section. 


Theorem 1.2.42. Let a be an adapted substitution with companion matrix 
M = M(a). Then 

1. M has an eigenvalue 9 > 1 and 9 > |A| for all eigenvalues A of M, 

2. 9 has an eigenvector with positive entries, 


3. 9 is simple. 


The eigenvalue 9 in the theorem above is called the dominant eigenvalue 
of M (<r) or of cr and the unique eigenvector associated with 9 such that the 
sum of its entries is equal to 1 is called the unitary dominant eigenvector of 
M(a). If a has p expansive letters we may assume that they form the set 
E = {1,... ,p} so that the matrix M(a) takes the form 


M = 


(M{g e ) 

V B 



The dominant eigenvalue 9 of M is also the dominant eigenvalue of M{ge) 
and the first p entries of the dominant eigenvector l of M , after normalisa¬ 
tion is the dominant eigenvector of M(<je )• 


So with this notation at hand we can define the interval exchange T a : 
[0,1[—>• [0,1[ by a cutting-stacking process associated to an adapted substi¬ 
tution a with dominating eigenvalue 9. 
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The positive vector l £ V s will be exactly the unitary dominant eigenvector 
of M(a). 

Then the required transformation will be 

T a : [0, l[\top(£) -» [0,1[ , 

where £ = (S n ) n > 0 . 

We now analyze the example given in |42, §3.7.4], 

Example 1.2.43 (The Fibonacci transformation). The Fibonacci substitu¬ 
tion apib on the alphabet A = {0,1} is defined as 

&Fib{ 0) = 01 CTpibi 1) = 0 . 

Since the iteration of each letter gives rise to an infinite sequence, the sub¬ 
stitution is adapted and the associated companion matrix is 

M(a Fib ) = (\ J) . 

The dominant eigenvalue is 9 = 1+ 2 V ^ , the golden ratio, and the correspond¬ 
ing normalized dominant eigenvector is l = ), where a = 9~ l = 9 — 1 = 

%/ 5—1 
2 • 

Moreover, one can notice that the following relation 

1 = F n+1 a n+1 + F n a n+2 

where (i ? n )n>o is the usual Fibonacci sequence, holds. So, by induction we 
also have that 

a n+i = (i)»(F n a - F n _0 = ||F n a|| . 

Now we want to get a transformation Tpn,, called the Fibonacci transforma¬ 
tion, by means of the cutting-stacking derivations 

( 5 nj iW)^( 5n+1> /(n+l)) . 

According to the general procedure defined above, we have that our first 
set of columns is 

5 0 = {ci 0) ,cf } = {[l-a,l[,[0,l-a[} , 

with cj 0 ^, Cf ] of width a and 1 — a, respectively. 

At this point, applying the cutting-stacking derivation to l = we 

get = (“ 3 ). Then each new column (7{ 0) = C^, will be 

obtained by stacking respectively 


|<FFi&(l)|i + WFib( 2) 11 = 2 
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sub-columns such that w yC 
Therefore 


\&Fib{ 1)|2 + \&Fib{ 2)h = 1 


= l[^ = a 2 and w (C. 


«'•) = 4‘> = 


c f /~i0) p<(i)\ / [0,1 u[ 1 

’ L2 / _ \ [1 — a, 2 — 2a[ [2-2a,l[J ' 

Let us produce one more step before giving the general description of the 
cutting-stacking process. 

Applying again the standard derivation we get = (“ 4 ). Then each new 

(1)' (2) (iV (2) 

column C\ = C\ = C\ w iH be obtained by stacking respectively 

|<7Fib(l)|l + |<7Fi&(2)|l = 2 


\&Fib( 1 ) |2 + \&Fib( 2 ) |2 = 1 

sub-columns such that w ^c{ 2 ^ = l[ 2) = a 3 and w (yO^ ) = 4^ = ° 4 - 
Therefore 

f [2 — 2a, 1[ ] 

5r = (c , j 1) , C'n 1) } = < [2 - 3a, 1 — a[ [0,2-3a[ >. 

^ * \[3-4a,2-2a[ [l-a,3-4a[J 

In general, the standard derivation depends on the parity of n and at every 
step we have that S n is composed of two columns c\" ' > and of height 
respectively F n +i and F n and width w ^C'[ n ^ = a n+1 and w (c^ 1 ^ = a n+2 , 
respectively. 

Since the position of the intervals in the columns depends on the parity of 
n, we have to describe top(C'{ ,! ^) and topC(”^ for n = 2 m and n = 2m + 1. 
If we denote by {x} the fractional part of x, we get that 

top = {{-F 2m a}, 1 [ top = [0,{-F 2 m+ia}[ 


top(cf m+1) ) = [0, {-F 2m+ ia}[ top = [{-F 2m+2 a},l[ • 

The transformation Tpi b obtained in this way is not defined at the point 0. 
Moreover it is not difficult to see that its explicit expression is defined by 
two families of transformations 

T Flb2k : top (Cf k) ) —> bot (Cf k) ) 
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T Fi b 2k {x) = x +a 

and 

T Fib2k+1 : top (tf fc+1) ) —► bot (cf fe+1) ) 

T Fib 2k+ 1 (x) = x - a 2 . 

Extending it by continuity to 0, the graph of Tpib is shown in the following 
picture. As we have already pointed out, this construction is not the only 



Figure 1.6: Fibonacci transformation 

one possible, starting from the Fibonacci substitution, but is the standard 
one in the sense that it is obtained via the standard derivation. 

It is worth considering that this transformation is nothing else but the T a , 


for a = ^ 1 


defined after Definition 


1 . 2.1 


Topological dynamics 

So far, we have been studying a measurable map T defined on a probability 
space We have asked whether the given measure /r is invariant 

or ergodic. Now we focus on the space Xi T (X) of all probability measures 
on the compact metric space X, which are invariant under a continuous 
transformation T : X —> X. 

Any continuous map T : X —» X induces a continuous map 


T* : M(X) -> M(X) 

defined by T(n)(A) = fi(T~ 1 A) for any Borel set A C X. Each point x € X 
defines a measure 5 X (called the Dirac measure) by 


S X (A) 


1 if x £ A 

0 if x £ A . 
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We claim that T{5 X ) — $T(x) for any x 6 X. To see this, let A C X be any 
measurable set, and notice that 

C TS x )(A) = S x (T- 1 A) = S t(x) (A ) . 

This suggests that we should think of the space of measures M(X) as gen¬ 
eralized points, and the transformation T : A4(X) —> A4(X) as a natural 
extension of the map T from the copy {5 x \x 6 X} of X to the larger set 
M(X). For f eC(X) and \x G M(X), 


ix 


fd{T*n)= ( foTd/i 
Jx 


and this property characterizes T. 

The map T is continuous and affine, so the set Xi T (X) of T-invariant mea¬ 
sures is a closed convex subset of Jvl(X). 

Given a continuous mapping T : X —>• X on a compact metric space, a nat¬ 
ural question, is whether invariant measures necessarily exist, i.e. whether 
Ai T (X) 0. The next result and its corollary show that this is indeed the 
case. 


Theorem 1.2.44. LetT : X —> X be a continuous map of a compact metric 
space, and let (zz n )neN be any sequence in M(X). Then any weak*-limit of 
the sequence (^ n )neN defined by 



n— 1 


Y. T * u > 

3 = 0 


is a member of M T (X). 

Corollary 1.2.45 (Krylov-Bogolioubov). Under the hypotheses of the pre¬ 
vious Theorem, M. T (X) is non-empty. 

Thus M T (X) is a non-empty compact convex set, since convex combi¬ 
nations of elements of Xi T (X) belong to Xi T (X). It follows that Xi T (X) 
is an infinite set unless it is a singleton. 

In general, it is difficult to identify measures with specific properties, but 
the ergodic measures are readily characterized in terms of the geometry of 
the space of invariant measures. 

We denote by £(X,T) C Xi T (X) the set of ergodic T-invariant probability 
measures on X. 

The next result will allow us to show that ergodic measures for continuous 
transformations on compact metric spaces always exist. 

Theorem 1.2.46. Let T be a continuous transformation of a compact met¬ 
ric space X equipped with the Borel cr-algebra B. The following are equivalent 
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1. a T-invariant probability measure fj, is ergodic; 

2. fi is an extremal point of M. T (X). 

A natural distinguished class of transformations is formed by those for 
which there is only one invariant Borel measure. This measure is auto¬ 
matically ergodic, and the uniqueness of this measure has several powerful 
consequences. 

Definition 1.2.47. Let X be a compact metric space and letT : X —> X be 
a continuous map. Then T is said to be uniquely ergodic if Xi T (X) = {/a}. 
The dynamical system ( X , A, /a, T) is said to be uniquely ergodic. 

Theorem 1.2.48. For a continuous map T : X —> X on a compact metric 
space, the following properties are equivalent 


1. T is uniquely ergodic. 

2. \S(X,T)\ = 1. 

3. For every continuous function f : X —> X, 


N -1 

lim - Y f(T r 

N—too N 

n =0 


x) = C f 


( 1 . 31 ) 


where Cf is a constant independent of x. 

4- For every continuous function f : X 
uniform across X. 


X, the convergence ( 1 . 31 ) is 


5. The convergence ( 1 . 31 ) holds for every f in a dense subset ofC(X). 


Under any of these assumptions, the constant Cf in ( 1 . 31 ) is f x fd/x, where 
pL is the unique invariant measure. 

An immediate consequence of the previous theorem is that the orbit 
(T n x) ri gN °f x under T is uniformly distributed for every x £ X. 


Let X = S 1 and T a : X —> X be the irrational rotation on the unit 
circle. We have already proved that the Lebesgue measure A is an ergodic T a - 
invariant measure. Furthermore the following result says that the Lebesgue 
measure is the only invariant measure, i.e. T a is uniquely ergodic. 

Lemma 1.2.49. An irrational rotation of a circle T a is uniquely ergodic 
and the only T a -invariant measure is the Lebesgue measure. 
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One important question concerns the unique ergodicity of the cartesian 
product of uniquely ergodic systems, that is under which conditions the 
product of uniquely ergodic systems is uniquely ergodic. As for the product 
of ergodic systems, it turns out that the spectral analysis of the associated 
operators is very useful. More precisely, again we need to require that the 
spectra of the associated operators intersect only at 1, as explained by the 
following 

Theorem 1.2.50. Let % = (Xj,Tj), i = l,...,s, be uniquely ergodic dy¬ 
namical systems. Then the dynamical system T\ x ... x T s is uniquely ergodic 
if and only if for all i, j 6 {1,..., s}, i / j, the discrete parts of the spectra 
of Ti and Tj intersect only at 1. 

1.2.3 Systems of numeration 

We have already considered some examples of numeration systems, such 
as the Ostrowski expansion or the 6-adic and (3 -adic expansions. Roughly 
speaking, a numeration system is a coding of the elements of a set with a 
(finite or infinite) sequence of digits. The result of the coding, the sequence 
of digits, is a representation of the element. 

In this paragraph we give a formal and general definition. For a complete 
survey on numeration systems from a dynamical viewpoint we refer to [9]. 

Definition 1.2.51. A numeration system (resp. a finite numeration sys¬ 
tem) is a triple (X,I,<p), where X is a set, I a finite or countable set, and 
ip an injective map <p : X —>■ I N , defined by 

<p(x) = (e n (x))n> 0 

The map p is the representation map, and ip(x) is the representation of 
xeX. 

Given a numeration system (X,I,<p), one can define an expansion as a 
map if : / N * — > X such that if o <p(x) = x for all x € X. An expansion of an 
element x € X is an equality x = if(y); it is a proper expansion if y = (f(x), 
and an improper expansion otherwise. 

However this definition is still unsatisfactory since for instance it is not 
clear how the digits €{ are produced and how the representation map is 
constructed. 

The dynamical point of view on numeration systems lies precisely in these 
requirements. So we need to find a map whose iteration gives the sequences 
of digits of the representation. In order to define this map in a precise way 
we need to introduce the concept of fibred system introduced by Schweiger 

m- 

Definition 1.2.52. A fibred system is a set X and a transformation T : 
X —> X for which there exist a finite or countable set I and a partition 
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= U e , X, of X such that the restriction T\ of T on X, is injective, for 
any i £ I. This yields a well defined map e : X — y I that associates the 
index i with x £ X such that 

Now it is easy to see that the definition of numeration system given 
above can have a dynamical description by means of a fibred system. The 
new definition of numeration system will be called fibred numeration system. 
Briefly, assume that (X, T ) is a fibred system as above. Let ip : X —> I ] V 
be defined by (p(x) = (e(T”x)) n >o- We will write e n = e o T n_1 for short. 
Let cr be the (right-sided) shift operator on J I! *. Then we get the following 
commutative diagram 


T 

X -» X 






jW _> jW 

' rr 


Formally, we have the following 

Definition 1.2.53. Let (X, T ) be a fibred system and ip : X —> I N * be defined 
by ip(x) = ( e(T n x)) n >o. If the function ip is injective (i.e., if ( X,I,p ) 
is a numeration system), we call the quadruple A f = (X,T,I,p) a fibred 
numeration system. Then I is the set of digits of the numeration system; 
the map p is the representation map and p(x) the M-representation of x. 
In general, the representation map is not surjective. The set of prefixes of 
M-representations is called the language C = C(N) of the fibred numeration 
system, and its elements are said to be admissible. The admissible sequences 
are defined as the elements y £ f v for which y = p(x) for some x £ X. 

The representation map transports cylinders from the product space T v 
to X, and for (io, ii, ■ ■ ■, i n - 1) £ I n , one may define the cylinder 

n —1 

X D C (foi i\ > • • • j in— 1) — f"''| T ^ ) — p [*0, ili ■■■ j in— l] • 

3=0 

Moreover, the assumption that the restriction of T to Xi is injective says 
that the application x *->• ( e(x),T(x )) is itself injective. It is a necessary 
condition for p to be injective, and J\T is a fibred numeration system if and 
only if 

Vx £ X P| C(e i(x),e 2 (x),... ,e n (x)) = {x} 

In the case when X is a metric space, this last condition is satisfied if for 
any admissible sequence (*i, ii ,..., i n ,...), the diameter of the cylinders 
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[ii, ii-, ■ ■ ■ > in] tends to zero when n tends to infinity. In this case, every 
closed subset F of X can be expressed as 

OO 

f= n u c(H,...,i n ) , 

n=l (i 1 ,...,i„)e£ 

which proves that the cr-algebra A generated by the cylinders is the Borel 
algebra. In general, T is „4-nreasurable. 

We can also define a topological structure on the fibred numeration system 
in the following way. 

Definition 1.2.54. For a fibred numeration system Af = ( X , T, I, ip), with a 
Hausdorff topological space I as digit set, the associated Af-compactification 
Xjy is defined as the closure of ip(X) in the product space I"' . 

Now we can see that some of the examples of numeration systems already 
mentioned in the section can be obtained with a particular choice of X, I, T. 

Example 1.2.55 (6-adic representation). Let X = N, / = {0,1,..., b — 1}, 
Xi = i + gN. Then, according to the definition of fibred system we have 

e(n) = n(mod q) . 


Let T : X —>■ X be defined by 


T(n) 



So we have a fibred numeration system with language, set of representations 
and compactification given by 


C b = (J{0,l,...,6-l} n , 

n> 0 


ip(X) = / (H) = {(* 0 , 0,0,...); n G N, ij G {0,1,. .., b - 1 }} , 

= {0,1,...,&-1} N . 

On the same set I of digits we can also consider the space X = [0,1 [ with 
partition given by 

[0,1[=L|W: = U 

iei iei 

and with transformation 


i i + 1 


T(x) = qx — [qx\ . 


The language is again C b and the compactification 7L b , the compact group of 
b-adic integers. The set of representations is the whole product space without 
the sequences ultimately equal to b — 1. 
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In the last part of this section we want to describe a popular extension 
of this method. It consists in changing the base b at any step: this method 
is called Cantor expansion. 

In this way, the 6-adic expansion can also be obtained beginning with the 
most significant digit, using the so called greedy algorithm. 

This procedure still gives rise to a numeration system, but not to a fibred one. 
Moreover, this way of producing expansions of nonnegative integers yields 
a more general concept than the Cantor expansion, the G-scale, which is 
the most general possible way of representing nonnegative integers based on 
the greedy algorithm. As we have seen, given a fibred numeration system 
Af, we can consider its associated A/’-compactification. For a G-scale, a 
compactification can be also built, but it is not possible in general to extend 
the addition from N to it in a reasonable way. Nevertheless, the addition by 
1 on N extends naturally and gives a dynamical system called odometer. 

Example 1.2.56 (/3-expansions and /3-adic van der Corput sequences). The 
basic idea to obtain fi- expansions is to replace b by any real number (3 > 1 . 
Then let X = [0, 1[ and Tp : [0, 1 [—> [0, 1[ be the (3-transformation defined by 
Tp(x) = fix mod 1. Moreover, let I = {0, 1,..., \fi~\ — 1}. The we have the 
following fibred system ([0,1[, Tp) 


[ 0 , 1 [-> [ 0 , 1 [ 


rN* 


-> /N* 


where the representation map ip of the fibred system is defined by 

CO 

x = S ° = (eo, d, • • •) € I m • 

n=0 ^ 

Parry m proved that the set of admissible sequences (p(X ) is charac¬ 
terised in terms of one particular fi-expansion. For x G [0,1[, set dp{x) = 
<p{X). In particular, let dp( 1) = (t n ) n > i. We then set d*p( 1) = dp(l), if 
dp{l) is infinite, and 

d*p( 1) = (h ... t m - 1, ( t m - 1))^ , 

if dp{ 1) = t\ ... is finite, with t m / 0. The set tp(X) of fi- 

expansions of real numbers in [0,1 [ is exactly 

<p(X) = {yelW :Vk>l,a k y^d*p(l) . 
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where u i(> in the sense of the lexicographical order. 

The set Xjy = </?( [0,1[) is called one-sided (3-shift. 

Numbers (3 such that dg{ 1) is ultimately periodic are called Parry numbers 
and those such that dg( 1) is finite are called simple Parry numbers. 

Then simple Parry numbers are those which produce improper expansions. 
To any element in Ay, we can associate the number 

OO 

= ^e n /r n_1 . 

n =0 

Then 4>g(x) E [0,1 [ and numbers with two expansions are exactly those with 
finite expansion 

V>g(ei • • • e k -ie k 0 u ) = tpg((e i . ..e k -i(e k - l)tfg(l)) • 

If (3 is a Pisot number, that is it is a real algebraic integer greater than 1 
such that all its Galois conjugates are less than 1 in absolute value, then 
every element of Q(/3) n [0,1 [ admits a idtimately periodic expansion (see 
e.g. lEBf). Hence (3 is either a Parry number or a simple Parry number. 

Now we can see how to construct (3-adic van der Corput sequences. These 
sequences, denoted by Ng, were introduced for the first time by Ninomiya M 
in 1997. Their construction is based on the (3-adic transformation and the 
fibred numeration system just defined. 

In order to define them we need to consider the following sets: 

• Xg(n) is the set of admissible sequences of length n, 

• w = 

. Fg{n) = jfXg(n), 

• G h( n ) = £?= o-*>(*) = # Y p{n)- 

Now for an arbitrary positive integer n, define l n to satisfy Gg(l n ) < n < 
Gp{l n + 1). Then, reverse the lexicographical order on the digits of Xp(l n + 1) 
for every n. Then the sequence Ng is defined as follows: 

Ng = {^p(“n-G 0 (l n )+i)} n=l ’ 
where oj ln+1 g Xg{l n + 1). 

In the following example with (3 = ^ +1 is given explictly. Let us derive 
the first elements of the sequence Ng by means of this method. 

When (3 is the golden mean v ^ +1 , we have that I = {0,1} and dg( 1) = 11(F. 
Hence the set tp(X) of admissible sequences is given by all sequences on the 
alphabet I such that no two consecutive digits of the form CiCi+i = 11 can 
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exist, for every i > 0. 

Now fix n = 4. By writing down the sequence of admissible digits in the 
right-to-left order 


1 

01 

001 

101 

0001 

1001 

0101 

00001 


we can see that Gp( 3) < 4 < Gp( 4). Then the first elements of Np with 
respect to n = 4 are 

Np(l) = fi~ l 
Np( 2) = /r 2 
Np( 3) = /5“ 3 
^(4) = /3- 1 +/3- 3 

lV^(5) = r 4 

iv^(6) = /r 1 + r 4 
iv^(7) = r 2 + r 4 

Ninomiya l69\ Theorem 3.1] also proved that when f3 is a Pisot number and 
all its conjugates belong to {z £ C : \z\ < 1}, then the fi-adic van der Corput 
sequence Np is low-discrepancy. 

Steiner 19^ gave an explicit formula for the discrepancy function of these 
sequences. In particular he showed that for Pisot numbers (5 with irreducible 
polynomial, the discrepancy function D(N, [0, y[) is bounded if and only if 
the fi-expansion of y is finite or its tail is the same as that of the expansion 
of 1. Moreover, if (3 is a Parry number, then he showed that the discrepancy 
function is unbounded for all intervals of length y ^ Q(/3). 

Definition 1.2.57. Let G = (G n ) n > o be an increasing sequence of positive 
integers with Go = 1. Then every positive integer can be expanded in the 
following way 

OO 

n = y^ £ k(n)Gk 1 , 

fc=o 

where £k(n ) e { 0 ,..., \Gk+i/Gk] — 1} and |"x] denotes the smallest inte¬ 
ger not less than x G M. This expansion (called G-expansion) is uniquely 
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determined and finite, provided that for every K 


K -1 

yi < G/f. (1-32) 

k=0 

For short we write £k for the k-th digit of the G-expansion. 

G = (G n ) n >o is called numeration system and the digits £k can be com¬ 
puted by the following greedy algorithm (for details see for instance [34]). 


Given a positive integer n, its greedy G-expansion is obtained in the 
following way: let i be the largest index such that Gj < n < Gj+i- Set 



Then n can be written as 

n = eiGi + n* , 

with 0 < m < G{. By iterating this procedure with m the expansion we 
obtain that the greedy representation for n is the string e^- 1 .. .eieo and 
we say n is greedily representable. 


It is easy to see that every non-negative integer is greedily representable 
if and only if Go = 1. If Go 1, it is possible for a number to be repre¬ 
sentable, but not greedily representable. For example, consider expressing 
12 in the numeration system G where (G n ) n >o is the sequence of prime num¬ 
bers (Go, Gi, G 2 ,...) = ( 2 , 3, 5,...). In this case we have that the largest 
index such that G; < 12 < Gj+i is 4, with corresponding G 4 = 11, but the 
procedure cannot go on. 

Note that if Go = 1, then the greedy representation is in fact the lexico¬ 
graphically greatest representation. A desirable property of any numeration 
system is that the mapping that sends an integer n to its representation 
is order-preserving. For more properties enjoyed by digits and numeration 
systems we refer to mm- 


We denote by Kg the subset of sequences that satisfy (1.32), i.e. Kq is 


the set of sequences e = eo^i • • • belonging to the infinite product 


n (G)= n(0,i,..,LGWG fe j-i} 


m =0 


satisfying (1.32) . The elements of Kg are called G-admissible. 


Although the G-scale is not fibred, one may consider its compactification, 
in the sense of the closure of the language in the product space 11(G). The 
set of nonnegative integers N is embedded in Kg by the canonical injection 
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n i-)- eo(n)ei(n)... €L{n)0 °°. 

Obviously K G is compact and it will be called the G-compactification of N. 
In order to extend the addition-by-one map defined on N to K G we introduce 
the set Kq C K G 

K° G = lxeK G : 3M x ,\/j > M x , ^ e k G k < G j+1 - 11 . (1.33) 

l fc=0 J 

Put x(j ) = 52{=o e kGk, and set 

t(x) = (e 0 (x(j) + 1)... £j(x(j) + l))e j+1 (x)e j+2 (x)... , (1.34) 

for every x E Kq and j > M x . This definition does not depend on the choice 
of j > M x . In fact, let l be the greatest integer such that x(l — 1) + 1 = Gi 
provided that such an l exists; otherwise there is no carry and we just add 
one to the first digit. Then for all j > l we have 

(,r(j) + l) = (e 0 {x(l) + 1)... ei(x(l) + l))xi+i... Xj 
= O l (xi + l)xi + i.. . Xj . 

So we can extended the definition of r to sequences x in K G \ Kq by 
r(x) = 0 = (0°°). In this way the transformation r is defined on K G and it 
is called G-odometer. We refer to m for a complete survey on odometers 
related to general numeration systems. 

In particular, the authors in m focus their attention to a particular family 
of G-odometers, namely G-odometers where the base sequence is a linear 
recurrence. In this case they can show that the map r is continuous and 
(K g ,t ) is uniquely ergodic. 

In the sequel we summarise the important steps of this result, that will be 
necessary in the proofs of Chapter 4. 

In the sequel we will consider sequences (G n ) ng N generated by a finite 
linear recurrence of order d + 1, where d + 1 is the period length: 

Gn+d+l = «0 G n +d + “lGn+d-l + ‘ ‘ ‘ + (&d + l)G«.j U > 0 . (1.35) 

and the initial values are given by Go = 1 and 

n 

G n +i = a n - k G k + 1 . 

k=0 

In [HI Theorem 5], the authors show that the odometer on an admissible 
numeration system G is uniquely ergodic, providing an explicit formula for 
the unique invariant measure /r. 
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Theorem 1.2.58. The odometer t is a uniquely ergodic transformation, 
i.e. there is a unique invariant measure /a given by 

g(Z) = (1.36) 

Fk,o /3 d-1 + (F K a — a 0 F K ^)fi d ~ 2 + ... + {F Kd _i — a 0 F K ^ d _ 2 — ... — a d - 2 F K ^ o) 

/3 K ((3 d - 1 +/3 d ~ 2 + ... + 1) ’ 

where Fk )T '■= #{n < Gk+ r ■ (eo(^), £i(n),...) £ Z} and Z is a cylinder 
with length k. 

Note that the formula in m Theorem 5] included a misprint and was 
stated in corrected form in [8]. 

In [3T] the author also prove that under a certain hypothesis the odome¬ 
ter has purely discrete spectrum. An analogous result was proved by Solo- 
myak [90] for linear recurrences with decreasing coefficients. 

Hypothesis 1.2.59 (Grabner, Tichy and Liardet |41|). There exists an 
integer b > 0 such that for all k and 

k OO 

N = 22 e i G i + 22 e i G T 

i=0 j=k+b+2 

the addition of G m to N, where m > k + b + 2, does not change the digits 
eo, • • •, Cfc, in the greedy representation i.e. 

k oo 

+ G m = 22 e iGi + 22 e J G T 

i —0 j=k-\-l 

Theorem 1.2.60. ICq is (measure-theoretically) isomorphic to a group ro¬ 
tation with purely discrete spectrum given by the countable group 

r := {z G C : lim z Gn = 1}. (1-37) 

n^oo 

provided that the above Hypothesis holds. 

This analysis of the spectrum will be very useful in Chapter 4 when we 
will consider the product of systems of the form We will prove that 

the product of these uniquely ergodic systems is uniquely ergodic, too. 


There are only a few results concerning this hypothesis. In [41] the au¬ 
thors remark that the Multinacci sequence, i.e. ao = ... = a d -i = 1, fulfills 
Hypothesis 1.2.59 Furthermore Bruin et al. [16] show that the numerations 


systems with coefficients (00,01,02) = (1,0,1) fulfills Hypothesis 1. 
Sometimes it seems easier to consider the following hypothesis introduced 
by Frougny and Solomyak m- 
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Hypothesis 1.2.61 (Frougny and Solomyak [ST])' The characteristic root 
fd of the numeration system G is a Pisot number such that all elements of 
the set 2.[i3~ 1 } have finite f3- expansions. 


Remark 1.2.62. Several researchers worked on algebraic characterizations 


of Pisot numbers f3 which satisfy Hypothesis 1.2.61. Solomyak m Main 


Theorem] showed that numeration systems defined by linear recurrences with 
decreasing coefficients fullfill Hypothesis 1.2.61 , proving an analogous of 
Theorem\1.2. 60 for this case. 


Furthermore Hollander 


states another sufficient condition for Hy¬ 
pothesis 1.2.61 and Akiyama |5] characterizes all Pisot units of degree three 
satisfying Hypothesis 1 1.2.6 f 


However there exists no complete algebraic characterization for Pisot num¬ 


bers satisfying Hypothesis 1.2.61 of degree greater than two. Note that both 
hypotheses can be satisfied by the same numeration system but, to the best 
of the author of this thesis knowledge, it is unknown if the two hypotheses 
are equivalent, see 














Chapter 2 

LS sequences 


In the first chapter we mentioned that the concept of uniform distribution 
and discrepancy applies also to sequences of sets and partitions. 

We presented the Kakutani splitting procedure and its generalization, the 
p-splitting, which produce uniformly distributed sequences of partitions. 

In this chapter we want to introduce the constrction of LS-sequences of par¬ 
titions. This is a countable family corresponding to pair of natural numbers 
L and 5 such that 5 > 0 and L + S > 2. For 5 = 0 and L > 2 the procedure 
reduces to the well-known sequences of partitions into b n equal parts, with 
b = L. These sequences of partitions have been introduced by Carbone PI- 
She proved that these sequences of partitions are low-discrepancy if and only 
if L > 5 and computed explicit asymptotic behaviour of the discrepancy of 
the sequences for L < 5. 

What is more interesting is that she presented an algorithm (improved in 
[18] ) which associates to any L5-sequence of partitions a sequence (called 
L5-sequence) of points and that for L > S they are low-discrepancy. For 
5 = 0 and L > 2 the algorithm provides the well-known van der Corput 
sequence of base b = L. 

2.1 LS'-sequences in the unit interval 

In this section we first give a description of the L5-sequences of partitions, 
obtained as subsequent p-refinerrients of the trivial partition ui. Then we 
describe the L5-sequence of points associated to the sequences of partitions. 

Definition 2.1.1. Let us fix two positive integers L and S, with L + 5 > 2 
and 5 > 0 and let 0 < a < 1 be the real number such that La + Sa 2 = 1. 
Denote by pl,s partition defined by L long intervals having length a and 
by S short intervals having length a 2 . By (pf s ) n &n we denote the sequence 
of successive pl,s- refinements of the trivial partition uj. This will be called 
the LS-sequence of partitions. 
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Example 2.1.2. Let us consider L = S = 1 and the corresponding 1,1- 
sequence of partitions (p" i) n eN whose characteristic polynomial is a + a 2 = 

1. Its solution in [0,1 [ is a = v/ ^ ) ~ 1 , i/ie inverse of the golden ratio. The 
first partition consists only of two intervals of length a and a 2 , respectively, 

i.e. 

Pi,i = {[ 0 .«[» [«> ![} • 

In order to construct the second partition, we divide the interval of maximal 
length proportionally to a and a 2 . Therefore 

pl ,i = {[0,a 2 [, [a 2 ,a[, [a, 1[} . 

At this point we have two intervals of maximal length, the first and the third, 
and we proceed by splitting them. Then we get 

Pi.i = { [0, a 3 [, [a 3 , a 2 [, [a 2 , a[, [a, a + a 3 [, [a + a 3 ,1[} , 


and so on. 


1 

0 



i 

a 

l 


i 

i 

_1 

0 


a 2 

1 

a 

l 

l 

1 

i 

l 

0 

a 3 

a 2 

1 

a 

I 

a + a 3 

1 

i 


Example 2.1.3. Let us consider the sequence of partitions {ptf i)neN corre¬ 
sponding to 2a + oi 2 = 1 . At the first step we have a subdivision of the unit 
interval into two long intervals and a short one, and we get the following 
partition 

P 2 ,i = {[0,«[, [a, 2a[, [2a, 1[} . 

In order to obtain the second partition we split the first two intervals, as 
follows 


P 2 ,i = {[0, a 2 [, [a 2 , 2a 2 [, [2a 2 , a[, [a, a + a 2 [, 

\ot -\- ey 2 , ot 2a 2 [, \ot -\- 2ck 2 , 2u[, [2 ex, 1 [} , 


and so on. 
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0 " 2a 1 


0 a 2 2a 2 a a + a 2 a + 2a 2 2a 1 


Let us note that every partition pf s produces only intervals of length 
a n (long) and of length a n+1 (short), respectively. If we denote by t n the 
total number of intervals of pf g, then the following linear recurrence holds 

t n = Lt n - 1 + St n - 2 , (2-1) 

with initial conditions to = 1 an d 11 = L + S. 

In fact, if we denote by l n the total number of long intervals of pf s and by 
s n the total number of short intervals, then the following relations hold: 

tn — In ~ I - S n , l n — Ll n —\ T Sn—1 i &n — Sl n —\ ■ 


This is beacause the n-th partition is obtained splitting the long intervals 
of the (n — l)-th partition into L long and S short intervals. The short 
intervals of the (n — l)-th partition remain untouched and hence become 
long intervals of the n-th partition. Formally: 


tn — ^n T Sn — Ll n _\ S n — \ T Sl n — i — 

— Lt n —1 “h 2 "h SLl n ~ 2 T Ss n — 2 — 

— L(Jn— 1 T ■Sn— l) T S{l n —2 T S n — 2 ) — 
= Lt n — 1 T St n — 2' 


From (2.1) immediately follows that the sequence (tn)ne hi the case 
L = S = 1, is the Fibonacci sequence. On the other hand we have seen that 
in this case (Pi i)neN is a Kakutani sequence. For these reasons, the cor¬ 
responding 1,1-sequence of partitions has been called in m the Kakutani- 
Fibonacci sequence of partitions. 


The following theorem shows that the LS-sequences of partitions are 
low-discrepancy if L > S. 

Theorem 2.1.4 (Theorem 2.2, jl9|)- i) If S < L there exist cj, c\ > 0 
such that 

c'i < t n D{pf s ) < ci for any n G N . 
ii) If S = L + 1 there exist C 2 , c' 2 > 0 such that 

c' 2 login < tnD{p r f s ) < c 2 login for any n G N. 
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in) If S > L + 2 there exist C 3 , c' 3 > 0 such that 

c' 3 tl l ~ r < t n D(pf s ) < c 3 t 1 ~ T for any n £ N , 


where r = 1 + 


log(gQ) 
log a 


> 0 . 


Observe that the same estimates for the discrepancy of the LS'-sequences 
of partitions have been reobtained by Drmota and Infusino [25], by applying 
Theorems 1.1.33 and |1.1.35 to this particular class of p-refinements. 

Let us now explain how to get the associated sequences of points of the 
corresponding LS-sequence of partitions, also introduced in [19]. 


Definition 2.1.5. Given the sequence of partition ^p£ s ^j , we define the 
LS-sequence of points (££ 5 )neN as follows. 

The first t\ points are just the left endpoints of the intervals of p l L s ordered 
by magnitude. This set of points will be denoted by A£ s and its points will 

be denoted by ..., For n > 1 and if A£ s = ..., is the 

set of the t n points (written in their order) of pf s , then the t n+ \ points of 
PI+ 1 are recursively ordered as follows: 


A n+1 — 
IV L,S ~ 




(d" 


An) 

> St„ ) 

An+l), 


: (n)^ 




,A n+l )tA n l: 

<Tl Isl, > 


'(Qr, 

AT AA, ■ ■ ■, AAAA,.... AAA ((A ,.... AAAAA) 


Here, l n is the number of long intervals of pf s , and the two families of 
functions are 


<p^ n+1 \x) = x + ia n+1 and <p^+ l \x) = x + La n+1 + j/3 n+2 
for 1 < i < L and 1 < j < S' — 1 . 

In [T9] it has been shown that, if L > S D(£} L s) = ® ( ^°n N ) ^ or 

n —> 00 and that to sequences of partitions (p£ with low-discrepancy 

correspond sequences of points (££ g) n eN with low-discrepancy. 

Theorem 2.1.6 (Theorem 3.9, [T9]), i) If S < L there exist k \, k[ > 0 such 
that for every N £ N 

k'ljj < d n(£,l,s>€l,S’ ■ ■ ■ ■ £l,s) < ^ • 


ii) If S = L + 1 there exist k?, k' 2 > 0 such that for every N £ N 


./ logiV 

'2 N 


< D n{€l,Si£l,S> ■ 



log 2 N 
N 
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in) If S > L + 1 there exist ks, k' 3 > 0 such that for every JVeN 


^3 N T -1 — D n{^L,S^L,Si ■ ■ ■ i£l,s) — ^3 


log AT 

JV ^ 1 


w/iere r = 1 + < 1 . 

As an example we show how to get the sequence of points associated to 
the sequences of partitions considered in the above examples. 


Example 2.1.7. We want to consider the Kakutani-Fibonacci sequence 
(Ci,i)neN- The set of points corresponding to the sequence of partitions p\ 1 
contains exactly the left endpoints of the two intervals [0,a[, [a,l[, i.e. 

A i,i = = (Ci,i)Ci,i) • 

We can observe that since we have only one short interval, then j = 0 
and therefore there are no functions of the type tpffj 1 ; but only the function 

(Pi t+1 ). Since l\ = 1, applying the function pp to the point zero we have 

A i,i = (0,a,^ 2 ) (0)) = (o,a,a 2 ) = (^1,1, ^1,1, <Ci,i) • 

and also 

A l,i = (o,a,a 2 ,<^i 3) (0),^i 3) (a)) 

= (o,a,a 2 ,a 3 ,a + a 3 ^ = (CiV> Ci,i, Ci, l, Ci,i> Ci,i) ■ 

Going on in this way and applying the function pp to the first three points 
(since I 3 = 3), we get Aj 1 as represented in the following figure. 


Cl Ct; Cl C.3 C* C 2 Cr Cr. 


0 oP oP oP oP + a 4 a a + a 4 a + oP 1 


Figure 2 . 1 : First 8 points of (Cf i)n>i 

By iterating this procedure we get the sequence of points (Cf i)neN associated 
to (Pl,l)neN- 

This sequence has been called Kakutani-Fibonacci in m and I2iy and it will 
be of particular interest in the next chapter, where we will consider a method 
to get this sequence as orbit of an ergodic transformation. 
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Example 2.1.8. Let us now considev the sequence of points Also 

in this case we do not have functions of the type but only of type 

ip[ n+1 \ with i = 1,2, to apply to the first l n points of the set K 2 1 . The first 
set of points is 

-A-2,1 = (o, a, 2 a^ = ^£2,11 £2,11 £2,1) • 

( 2 ) ( 2 ) 

Then, since l\ = 2, if we apply the two functions ip\ ' and ip 2 to the first 
two points of A\ ] we have 

A ii = (0,a,^S 2) (0),^S 2) (a),^2 2) (°)^2 2) («)) 

= ^0, a, 2a, a 2 , a + a 2 , 2a 2 , a + 2a 2 ^ 

= (^2,1! £2,1)^2,l>£f,I jCI.IjCs!,11^2,1) • 

Let us now compute the set of points of p 21 . 

Applying the functions and tp^ to the first five points of A 21 (since 
I 2 = 5 ), we have 

Agj = ^0, a, 2a, a 2 , a + a 2 , 2a 2 , a + 2a 2 , 

</4 3) (0)> <^S 3) («), (2a), p[ 3) (a 2 ), <^ 3) (a + a 2 ), 

</4 3) (0), <^2 3) («). </4 3) ( 2 «) > V 2 ] (« 2 )>^2 3) (« + « 2 )) 

= ^0, a, 2a, a 2 , a + a 2 , 2a 2 , a + 2a 2 , 

a 3 , a + a 3 , 2 a + a 3 , a 2 + a 3 , a + a 2 + a 3 , 

2a 3 , a 2a 3 , 2a -1- 2a 3 , a 2 2a 3 , a a 2 “f* 2a 3 ^ 



The first points of this point set are shown in the following picture 

Ci Cs C: C(i £2 £9 v. £7 £3 

I-1-1-1-1-1-1-1-1-1 

0 a 3 a 2 2a 2 a a + a 3 a + a 2 a + 2a 2 2a 1 


Figure 2.2: First 9 points of (C 2 'i)n>i 

More efficient ways of constructing the LS-sequences of points have been 
recently introduced in [4] and [ 18] . 
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2.2 LS'-sequence in higher dimension 


We now consider two possible bidimensional extensions of the LS-sequences 
of points, namely the LS-point set a la van der Corput and the LS-sequences 
a la Halton. These extensions have been proposed to me by I. Carbone and 
A. Volcic as the main object of my master thesis |50] and are contained in 
the unpublished paper [ 20 ]. 

Definition 2.2.1. An LS-point set a la van der Corput of order N E N in 
[0,1[ 2 is defined as 

*n=(^^L,s) n = 1,... ,N , (2.2) 


where (££ s)neN is an LS-sequence of points. 


Remark 2.2.2. By Lemma 1.1.55 , it follows immediately that if we choose 
a low-discrepancy LS-sequence (££ 5 ), then the corresponding LS-point set 
a la van der Corput is low-discrepancy in [0,1[ 2 . 


Definition 2.2.3. An LS-sequence of points a la Halton in [0,1[ 2 is the se¬ 
quence ((££ lSl ,££ 2 s 2 ))neN, where (^ l5l ) n6 n and (££ 2 s 2 )neN are two distinct 
LS-sequences of points. 


Both definitions can be extended to an s-dimensional sequence in two 
possible ways. The first one is to consider s-tuples of LS-sequences, while 
the second one is to consider s-tuples where the first coordinate is ($), and 
the remaining s — 1 are LS'-sequences. 

Of course, like for classical sequences already encountered in the first chap¬ 
ter of the thesis, such as the multidimensional Kronecker sequence and the 
Halton sequences, the main problem is to provide a condition (often alge¬ 
braic) to assure the uniform distribution property in the multidimensional 
setting. 

In fact, even pairing low-discrepancy LS-sequences, in general, does not 
guarantee uniform distribution in the unit square. The following two figures 
show how the behaviour of the bidimensional sequence can indeed be very 
erratic. 

A partial answer to this problem has been given by Aistleitner et al. 
in j3], where the authors show that if the roots cq, a 2 of the polynomials 
Six 2 + L\x — 1 = 0 and S 2 X 2 + L 2 X — 1 = 0 do not fulfill a precise condition, 
then the sequence ((^£ 1 5 l) ^£ 2 s 2 ))neN is not dense in [ 0 , 1 [ 2 . 

In the sequel we will denote by B = ((Li, Si),..., (L s , S s )) an s-tuple of 
pairs, each of them defining an LS-sequence of points and by (£g) n eN the 
corresponding multidimensional LS-sequence, obtained by combining them 
componentwise. 

More precisely, they proved the following 
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Figure 2.3: ((C”i, ^5,i))neN for 1 < 
n < 5000 



0 0.2 0.4 0.6 0.8 1 


Figure 2.4: ((^n €2,i))neN for 1 < 
n < 5000 


Theorem 2.2.4. Let B = ((Li, Si), (L2, S 2 )) with Li > Si — 1 > 0 and 

a k+1 

assume that there exist integers m and k such that G Q. Then the 

a i 

two-dimensional LS-sequence (£g) n eN n °t uniformly distributed, and not 
even dense in [0,1[ 2 . 

For instance, in the case (Li,S'i) = (1,1) and (4/2,52) = (4,1) (see Fig- 
we have a .2 = af and so the sequence is not even dense in [0,1[ 2 . 


This theorem can also be applied to the multidimensional case, since for 
any multidimensional sequence of points, which is uniformly distributed, all 
lower-dimensional projections also have to be uniformly distributed. More 
precisely, one has the following 

Corollary 2.2.5. Let B = ((Li, Si),..., (L s , S s )) with Li > Si — 1 > 0 
and assume that there exist numbers u,w G {1,... ,s} and integers m and 

fc+i 

k such that “ +1 G Q. Then the s-dimensional LS-sequence (£g) n eN is not 

Ot-yj 

uniformly distributed, and not even dense in [0, l] s . 

However, no positive results, proving uniform distribution of a bidimen- 
sional LS-sequence for an appropriate choice of L\, Si and L 2 , S2, are known 
so far. 


ure 


2.3 














Chapter 3 


A dynamical system 
approach to the 
Kakutani-Fibonacci sequence 


In this chapter we show how to obtain the Kakutani-Fibonacci sequence of 
points, introduced in Chapter 2, as orbit of an ergodic transformation. More 
precisely, we consider the Kakutani-Fibonacci sequence of partitions and as¬ 
sociate to it an ergodic interval exchange, which will be called the Kakutani- 
Fibonacci transformation. The construction of the interval exchange makes 
use of the cutting-stacking technique, that we already considered in Chap¬ 
ter 1. Moreover, we will prove that the orbit of the origin under this map 
coincides with the low discrepancy Kakutani-Fibonacci sequence of points, 


defined in Example 2.1.7 


The content of this chapter is presented in 


Definition 3.1.1 (Cutting-stacking technique for the Kakutani-Fibonacci 
sequence). For every fixed n, the intervals of the n-th Kakutani-Fibonacci 
partition a n oj are represented by two columns C n = {L n ,S n } constructed as 
follows. 

Let us start with two columns C\ = {Li,Si}, where L\ = [0,a[, Si = 
[a, 1[. If we divide L\ proportionally to a and a 2 , we can write L\ = 
{L\,L\}, where Lf[ = [0,a 2 [ and L\ = [a 2 ,a[. Now we stack the interval S\ 
over lf{ (and use the common notation Li*S\) as they have the same width, 
namely w(L\) = w(S i) = a 2 . So we get two new columns C 2 = {L 2 ,S- 2 }, 
where L 2 = L\* Si and S 2 = L\. We denote by b{L 2 ) = [0,a 2 [ the bottom 
of L 2 and by h{L 2 ) = 2 = I 2 its height. 

If we continue this way, at the n-th step we get two columns denoted by 
C n = {L n ,S n }, with h(L n ) = l n and h(S n ) = s n . We divide L n into two 
columns, say and Lwhere w{L^f) = a n+l and w (L^) = a n+2 , and 



A dynamical system approach to the Kakutani-Fibonacci sequence 


70 


stack S n over L^, obtaining 

Cn+l — {-^n+1;‘S'n+l}) 

where 


with 


and 


w 


L n +i — L n * S n and <SVi+i — C n , 


(Ln+i) = a n+1 , h(L n+1 ) = l n+ 1 , b(L n+ i) = [0, a n+1 [ 


w(S n+ i ) = a n+2 , h(S n+ 1 ) = s n+ i , b(C n+ 1 ) = [a n+i , a n [ . 


_ r n +1 nr 


The first steps of the procedure are visualized in Figure [3T 

L 2 


C 2 


Ci 

Li Si 


a + o? 1 S 2 


l 3 c 3 


S 3 


a + a J a + a' 


3 1 


Figure 3.1: Partial graph of the cutting-stacking method for (a n u:) n >i 


To the above cutting-stacking construction it is naturally associated an 
interval exchange, whose explicit expression is given by the following result. 

Proposition 3.1.2. The interval exchange corresponding to the cutting¬ 


stacking procedure described in Definition 3.1.1 is the map T : [0, l[-> [0,1[ 
whose restriction to Ik is Tk, where 

Ti(x) = x + a if i£/i = [0,a 2 [ 


and, for every k >1, 


k -1 


T 2k (x) = x + a 2k - a 2j+1 if x G I 2k = 

j =0 


k -1 


£« 2)+1 .£ 

j =0 j =0 


a 


21+1 
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and 


T 2k+1 (x) = x+a 2k+1 


k -1 


-J2^ 2U+1) 

3=0 


if x E I 2 k+ 1 


x;« 2 (, ' +i) ,E“ 2 °’ +1) 

1=0 1=0 


Proof. If we write T(x) = x + c k whenever x E I k for any k > 1, we 
simply observe that I k + c k = [a k ,a k + a k+l [ for all k > 1. Therefore, 
A(Ufc>o(/fc + Cfc)) = 1 and (/^ + c/J n (/*, + c*,) = 0 whenever h / k, which 
proves that T is an interval exchange. 

Now we show how the map T acts in the cutting-stacking procedure, 
proving that in each column C n = {L n ,S n } the transformation T maps 
each interval of L n (respectively, S n ) onto the interval above it and the top 
interval of L°, denoted as usual by top(L^), onto the bottom interval of S n . 

Let us start with C\ = {-Li,Si}. When we divide the column L\ into 
two sub-columns L® and L \, made by one interval each, we notice that 
top(L\) = I\ and b(S\) = T(/i). Therefore, we stack Si onto L\ using Ti, 
which maps I\ onto b(S i) and gives birth to the columns C 2 = {L 2 ,S 2 }, 
where L 2 = L\* S\ and S 2 = L\. 

Now we consider n = 2k and C 2k = {L 2k , S 2k }. When we divide L 2k 
proportionally to a and a 2 , obtaining therefore L® k and L\ k , we notice that 
b{S 2k ) = T 2k (I 2k ) = [a 2k ,a 2k+1 [ and, consequently, top{L^ k ) = I 2k because 
T 2k is a bijection. In other words, T 2k stacks the bottom of S 2k onto the top 
of L° 2k because T 2k (t.op(L^ k )) = b(S 2k ) = [a 2k ,a 2k ~ 1 [ . 

A simple calculation shows that if we consider the case n = 2k + 1 we 
have T 2fc+ i(/ 2 fc+i) = b(S 2k+ 1 ) = [a 2k+1 ,a 2k [ and, therefore, top(L% k+1 ) = 
hk+i- 

This completes the proof of the proposition. J3 


Definition 3.1.3. The map T : [0,1 [—> [0,1[, whose expression is given in 
Proposition \ 3.1.2, is called the Kakutani-Fibonacci transformation. 


Figure 3.2 shows the graph of the maps T k . with 1 < k < 4. 


It is worthwhile to compare the Fibonacci transformation Tp intro¬ 
duced in [32] (that we have already considered in the hrst chapter) and the 
Kakutani-Fibonacci transformation just defined. Both transformations are 
obtained by means of a cutting-stacking technique and at each step of the 
cutting-stacking procedure, the two columns obtained by means of the Fi¬ 
bonacci transformation have the same height and width as the two columns 
obtained by means of the Kakutani-Fibonacci transformation T. Neverthe¬ 
less, the bottoms are different as well as the order of the intervals in each 
column. Moreover, as we have already pointed out, the Fibonacci transfor¬ 
mation is not defined in 0, while the Kakutani-Fibonacci transformation is 
defined in 0 and, as we will show later, its iteration in 0 gives rise to the 
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Figure 3.2: Partial graph of the Kakutani-Fibonacci transformation T 


Kakutani-Fibonacci sequence. 

If we extend, as we did in Chapter 1, Tpib to 0 by continuity letting Tpib( 0) = 
a, we see that the extension of Tpib coincides with the rotation on [0,1[ mod 
1 by 

Theorem 3.1.4. The Kakutani-Fibonacci transformation T is ergodic. 

Proof. The arguments we use are inspired by [36]. 

Let us denote by C ^ the stack made by L m * S m and let us define the 
mapping r m : [0, l[-» [0,1[ as follows. For 1 < i < l m — 1 it is the translation 
of the interval Jj of L m onto J t +\ , the interval just above Jj, and hence it 
coincides with T. If i = l m , it is a contraction by parameter a of top(L m ) 
onto b(S m ). For 1 < i < s m — 1 it is the translation of the interval J* of S m 
onto ■J l +\, the interval just above and hence it coincides with T again. 

Now we fix a measurable set B in [0,1[ with A (B) > 0, such that 
T _1 (H) = B, and we prove that A (B T ) = 1, where B T = Uit^oo T l {B). 

From Lebesgue density theorem, there exists a point xo having density 
1 for B. This implies that for every fixed e > 0 there exists 5 > 0 such that 
if I is any interval containing xq with A (I) < 6 , we have 

A(Bnl) > (1 -e)A(J). (3.1) 

Due to the fact that the diameter of the Kakutani-Fibonacci sequence of 
partition (a m uj ) rn >i tends to 0 when m —> oo, there exists m > 0 such that 
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ot m < 5 and, therefore, there exists an interval of for which (3.1) holds. 
For sake of brevity, we will denote it by /. 

As C* n = L m * S m , the interval I could belong to the column L m or to 
the column S m . If I E S m , it is equivalent to say that / E L m+ For this 
reason, without any loss of generality we may suppose that I E L m . 

Suppose now that I is the r-th interval from below in the column L m 
and observe that 

/ t m -r \ 


x U r -( J ) = L 




Taking the above identity and (3.1) into account, we have the following 
inequalities: 

A {B t ) > A ((Bn/f) 

/ lm T \ / Sm 1 

A |J T(Bn/) +A \J (T j (B n b(S m ))) 


> 


\i=l—r 
( r 


3=0 

tm-r 


= A U 4(sni) +A U 

\i=l-r J \j=i m +1—^ 

lm T tm T 

= ^A(r;(Bn/))+ Y, A (ri(8n/)) 

i=l-r j=l m +l~r 

> (1 — e) l m a m + (1 — e) s m a. m+1 = 1 — e. 


As e is arbitrary, we conclude that A (B) = 1. Therefore, T is ergodic and 
the theorem is proved. □ 


A direct consequence of the above theorem and Birkhoff’s Theorem (The¬ 


orem 


1.2.11) is the following 


Theorem 3.1.5. The sequence (T n (x)) n > o is u.d. for almost every x E 

[ 0 , 1 [. 


We now show how to get the points of the Kakutani-Fibonacci sequence 
by means of the Kakutani-Fibonacci transformation. 


Theorem 3.1.6. The Kakutani-Fibonacci sequence of points { ) n >\ co¬ 
incides with (T n (0))n>o• 

Proof. We want to show by induction on n > 1 that 


(£i, 6,..., 6n) = (0,T(0),T 2 (0),.. ,,T tn ~ 1 (0)). (3.2) 


If n = 1, (3.2) is obviously verified. 
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We 


suppose that (3.2) is true for n and prove that 

(6> &,■■■, &„>0n+i(£i),• • •,0Ti+i(£/„)) 

— 1n TYm r rtn(r\\ T^+DrA 


Due to the definition of the set A” ( of points of the n + 1-th Kakutani- 
Fibonacci sequence of partitions (see Example 2.1.7) and the inductive as¬ 
sumption, it is sufficient to prove that 


(&»+l(0), </>n+l(T(0)), . . . , 4> n+l (T^-\ 0))) 

= (T tn (0), T*" +1 (0),..., T tn+ln ~ 1 { 0)). 

As (j) n+ i(x) = x + ct n+1 , the above identity is equivalent to 

(a (n+1 ),T(0) + a (n+1) ,...,T t "- 1 (0) + a (n+1) ) 

= (T* n (0),T tn+1 (0),... ,T tn+ln ~ 1 (0)). (3.3) 


In order to prove (3.3), we focus our attention on the intervals of the 
column S n +i and on their left endpoints. 

We note that, due to the cutting-stacking procedure described in Def¬ 
inition |3.1.1 and to the nature of T described in the proof of Proposi- 
and specifically to the fact that b(S n +i) = T(top(L n+ 1 )) = 
[0,a ri+1 [ )), the columns C n +1 = {L„+i, 5 n +i} can be writ- 


tion 3.1.2 

^ ^pln~\~Sn~ 1 

ten as follows: 


L n +1 = L° n * S n = (b{L° n ) * • • • * top(L^)j * (b(S n ) *■■■* top(S n )j 

= ([0,a n+1 [ *•••* T ln ~ l { [0, a n+1 [ ) ) * 

* (T ln ( [0, a n+l [ ) *■■■ * T ln+Sn ~ 1 ( [0,a n+1 [)) 


Sn+l = ^ 


= b(Li) *■■■* top(Ll) 

= {[a n+ \ a n { * • • • * T /n_1 ( [a n+1 ,a n [ ) ) 

= (r tn ( [0,a n+1 [ ) *••• * T tn+ L~ l ( [0,a n+1 [ )). 


Therefore, as T is an interval exchange, the t n left endpoints of L n+ i are 
0, T(0), • ■ ■ ,T t " _1 (0) and the left endpoints of S' n+ i, which are a right shift 
by the constant ot n+l of the le ft en dpoints of L n+ i, are T tn (0), T* n+1 (0), 
... ; T* n+s,1_1 (0), which proves (3.3). 

The theorem is now completely proved. □ 
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Remark 3.1.7. As we have already pointed out, the Kakutani-Fibonacci 
sequence of points (£ii)n>i is an example of uniformly distributed sequence 
(in fact low-discrepancy). Unfortunately, this property can not be detected 
by this ergodic approach, since Birkhoff's Theorem (Theorem 1.2.11) implies 
the uniform distribution of the orbit ( T n x) n only for almost every point 
x G [0,1 [ and not for every x 6 [0,1[. 


We will see in the next chapter that the Kakutani-Fibonacci transfor¬ 
mation is not only ergodic, but in fact uniquely ergodic and therefore we 
can prove the uniform distribution not only of the Kakutani-Fibonacci se¬ 
quence, that was already known to be so, but of all orbits ( T n x) n e pj, for 
every x E [0,1[. 







Chapter 4 


Ergodic properties of /3-adic 
Halton sequences 


In this chapter we will construct a parametric extension of the classical s- 
dimensional Halton sequence, where the bases are special Pisot numbers. 
We use methods from ergodic theory in order to investigate the distribu¬ 
tion behavior of such sequences constructed as orbits of uniquely ergodic 
transformations. As a consequence of the main result, we show that the 
Kakutani-Fibonacci transformation, introduced in the previous chapter, is 
uniquely ergodic. 

The content of this chapter is presented in [46] . 


Our first goal is to prove that the product of systems of the type (JCq, t) 
considered in Chapter 1 is uniquely ergodic. 


Theorem 4.1.1. Let G l ,...,G s be numeration systems given by (1.35) 
and let the coefficients of the linear recurrences be given as a*- = 6*, j = 
0,..., (di — 1), i = 1,..., s, with pairwise coprime positive integers bi, i = 

Qk 

1,..., s. Furthermore let ^ Q, for all l, k G N, where j3\, ..., /3 S denote 

r' 7 


the characteristic roots of the numerations systems. Then the dynamical 
system which is constructed as the s- dimensional Cartesian product of the 
odometers, i.e. (() x ... x ( JCg s ,t s )), is uniquely ergodic. 


Proof. It follows by Remark 1.2.62 that each numeration system G J , j = 

l , ..., s fulfills Hypothesis 1 1 . 2 . 6~H thus the components of the s-dimensional 
dynamical system are uniquely ergodic. By Theorem 1.2.50, we derive that 
the Cartesian product is uniquely ergodic if and only if Tj n = {1} for 
all 1 < j < k < s, where we denote by T* the spectrum of r*. As noted in 

m, we have 
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where the constant Cj can be computed by residue calculus. Using the 
standard notation ~ for asymptotic equality (if n —> oo) we obtain for fixed 
l GN 


L .Gh 

exp 2 tti — r 

l A. 


exp ( 2TriCj/3 i 


n—l 


~ exp | 2iriG J n _ l , , 


and thus 


lim exp ( 2-rr j = lim exp (27r iG 3 i ) = 1 


3 


where Cj is given by (4.1). Furthermore, it is easy to see that for every 


k G N there exists an no with | Gh for all n > no and there exist no 
b', n' 0 G N with gcd(6', bj) = 1 such that b' \ Gh for all n > n' 0 . Then Tj can 
be written as 


Ti = 


exp 2iri 




: m,l,c € N U {0} 


This completes the proof since Tj n T^ = {1}. 


□ 


Now we want to construct an isomorphism between (/Cg, r) and ([0,1[, T), 


where T is conjugate to r, i.e. T = <f>a o t o 


L-l 


The construction recalls 


the one considered for (Z j,,t) in Example 1.2.14 


The first step consists in extending the definition of the Monna map to 
irrational bases (3 > 1. Let 

n = Yl G( n ) G j 

j> o 

be the G-expansion of an integer n. For short we write and define the 
/3-adic Monna map (j)R : Kq —> K + as 


n) = 


J2G( n )G] =J^e i (n)/3 J 1 

J>o ) j> o 


Furthermore, we define the radical inverse function as the restriction of (j)p 
on ICq and similarly we define the pseudo-inverse (/>// 1 . In this context the 
/3-adic Halton sequence is given as (</>^(n)) ne N = (^(n),..., ^/3 s (n)) ne N, 
where (3 = (/3i,..., /3 S ) and the /3* are characteristic roots of the numeration 
systems G l , respectively. 


Note that even if one of the Hypotheses 1.2.59 or 1.2.61| holds, this does 
not imply that the image of ICq under <f>p is contained in [0,1[ and dense in 
it. 
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Lemma 4.1.2. Let a = (ao,..., where the integers ao,..., a ( i-i > 0 

are the coefficients defining the numeration system G and assume that the 
corresponding characteristic root ft satisfies 

P ~ “° + + ••• + ^d=I’ ( 42 ) 

where ao = \_P\. Furthermore, assume that there is no b = (6 q, ■ • •, bfc-i) 
with k < d such that fi is the characteristic root of the polynomial defined 
by b. Then (/>p(N) C [0,1[ and N) (jL [0, x[ for all 0 < x < 1 if and only 
if a can be written either as 

a = (ao, • • •, ao) or as (4-3) 

a= (a 0 ,a 0 -l,...,a 0 -l,a 0 ), (4.4) 


where ao > 0. 


Proof. It follows from (4.2) that 


«0 0,d -1 

fi d 


(4.5) 


Furthermore for all admissible representations (eo,£i,£ 2 > • • •) we have 


feb £ k- 1 , ■■■■,£ o, 0°°) < (ao, ai,..., ad- 1 )°° (4.6) 


for every k, when < denotes the lexicographic order. 


We consider a representation given by c = (co,..., Cfe_i)°° for k > 0 
and assume that there are no positive integers c( and m < k such that 
c = (eg,..., c / m _ 1 )°°. We obtain 




i 


2 “ + .. 

P 

P 


+ 


+ 


Cfc-l 


P k 


1 _ L 

1 ak 


Cfc-l 


P k 


fi k ) - 1 


To finish the proof of the lemma, the following will be shown: the maximum 
of (f>g{c) (extended over all representations c) is 1, provided that (4.3) or 
(4.4) are satisfied, i.e. 


CO Cfc_ 2 Cfc_i + 1 

P + ''' + fi k ~ l + fi k 


(4.7) 
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We have that k > d since a is by assumption the minimal representation 
of /3. In the case k = d we get 


co 

p 


Cd-2 Cd -1 + 1 

' + "d -1 + 


/3 ( 


/3 d 


«0 Od-l 

=0 + - + ^T 


= 1, 


thus c 0 = ao,...,Cd-2 = ad-2 and Cd-i + 1 = ad- 1 - Moreover, by (4.6), 


we have rnax I=0 / j_i a < a o and thus ao = max(maxj =0) ... j d _2 a*, a^-i — 1). 
Let m > 0 be the maximal integer such that a* = ao for i < m. Then, 
if ad -1 = ao + 1 we obtain that c = (ao, a \,..., a^- 2 , ao)°° contains m + 1 
successive ao’s, thus c is not admissible and ao = maxj = o,...,d-i a*. Similar 
arguments yield k < d since by assumption c / (c 7 0 ,..., c^^) 00 for m < k. 


Assume now that k = d, ao = maxj = o,...^_i a* and let m > 0 be defined 
as above. By (4.5) and (4.6) c* = (ao,..., a m , (ao — 1, ao,..., a m )°°) is 
admissible and 

M c *) > !. 


where equality holds if m is either 0 or d — 1 which are the cases (4.3) 


and (4.4). This yields the assertion (4.7), thus the proof of the lemma is 
complete. □ 

If we drop the condition that d has to be minimal, we obtain the following 
additional cases for which the above lemma is satisfied: 


a — (®o, • ■ •, ao, ao + 1) 


and 


.aha") 


(4.8) 

(4.9) 


a = (a', 

where ao > 0, a 7 , a" are of equal length and of the form 

a (ao, • • *, a o, ao 1), a (ao, • • •, ao) or 

&/ = («o, ao — 1, ■ ■ •, ao — 1), a 77 = (a o, ao — 1,..., ao — 1, ao). 

Note that (|4.8[) is another way to represent the (ao + l)-adic number system, 


which is a special case of (4.3) and obviously fulfills the lemma. Furthermore, 


condition (4.9) is a reformulation of (4.3) and (4.4), thus in the sequel we 


only consider numeration systems which satisfy (4.3) or (4.4). 


Lemma 4.1.3. Let G be a numeration system of the form (1.35), assume 


that the coefficients of the linear recurrence are given by aj = a, j = 
0 ,,(d — 1), for a positive integer a and let (3 denote the corresponding 
characteristic root. Then n(Z) = A {(f)p{Z)) for every cylinder set Z. 

Proof. Let the cylinder set Z be defined by the fixed digits eo, 

Assume first that ek-i < a, then F^ +r = (a + l) r for 0 < r < d. Thus, by 


(1.36), we obtain that 


kz) = r k - 
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Consider the /3-adic Monna map of n E N, i.e. 


<M n ) = 




1=0 


p 


i +1 ' 


If €k~i < a we easily see that (Z) is dense in 

(ffc-i + 1) 


I = 


7c-1 k—2 

e,; \ - e,; 


E^tt-E 


Z^ 1 


+ 


0* 


and that 4>p{ x ') $ I H x' $ Z. Thus (f>p(Z ) is A-measurable and A (4>p(Z)) = 

a (/) = r fc - 

Assume now that Z is defined by the fixed digits eo,..., ek -2 and ek-i = 
a. By the above argument we derive that a cylinder with fixed digits 
eo, • • • , efc_ 2 , and ek -2 < cl, has measure /3 _ ( fc_1 b 


Now compute the measure of Z\ 

n(Z) = _ ( a - l)/3~ k . 

Next we consider cj)p(Z), hence 

(ffc-2 + 1) 


M z ) = TI’Z/ 

.z 

and thus X((j>p(Z)) = fi{Z). 


7c—1 


k —3 


fli+l ’ fii+l 

U=o i=o ^ 


+ 


P k 


Let 2 < h < min(fc, d — 1) and consider a cylinder set Z with fixed 
digits eo,..., ek-h-i < a and €k~i = a for l = 1,..., h. Then, the cylinder 
with fixed digits eo,..., ek-h-i has measure p~^ k ~ h ) and every cylinder with 
digits eo,..., ek-h+i has measure p -( k ~ h + 2 ). Thus it follows 

n{Z) = - (a - \)p-(k-h+2)' 

Considering 4>p(Z) we have 


M z ) = 


h—h +1 


k—h 


E fc z fc z 

tfz+l ’ tfz+l 
L z=0 ^ i= 0 ^ 


(ffc-h-l + 1) 


P 


h—h+2 


and therefore A (cj)p(Z)) = n(Z). 


H 


As mentioned above, a result of Frougny and Solomyak isa Lenrnra 3] 
implies that the dominant root of 

x 2 — clqx — ai, ao, fli E 1, 


is a Pisot number if and only if a o > a\. Lemma 4.1.2 shows that the image 
of JCq under (f>$ is not a subset of [0,1[, when ao > ai. It follows that Lemma 
4. 1.3| characterizes all van der Corput-type constructions for d = 2. 
























Ergodic properties of /3-adic Halton sequences 


81 


Theorem 4.1.4. LetG l ,...,G s be numeration systems as in Theorem f.1-1 
and let P\,p s denote the roots of the corresponding characteristic equa¬ 
tions. Then the s-dimensional /3-adic Halton sequence (q n is u.d. in 

0- i;“- 


Proof. By Lemma 4.1.3 and the definition of the Monna map we obtain an 
isometry between the dynamical systems (( JCqi,t \) x ... x (K.g s ,t s )) and 
(([0,1[, T\) x ... x ([0,1[, T s )) where 


Tc. [ 0 , l[-4 [ 0 , 1 [, Ti(x) := </>& on o (j)^{x). 

Let Tx = (T]X\, ..., T s x s ) for x = (aq,..., x s ) G [0, l[ s . Hence by Birkhoff’s 
ergodic theorem, (T"x) ne p;j is u.d. in [0, l[ s for all x £ [0, l[ s . In particular, 
0/3( n ))«6N = (T n 0) neN is u.d. □ 


Note that the classical b-adic Halton sequence with pairwise coprime 
integer bases b \,..., b s > 2, is a special case of Theorem |4.1.4| 

Theorem 4.1.5. Let the numeration system G be defined by the coefficients 
( 00 , 01 , 02 ) = (1,0,1), let f be its characteristic root and r the odome¬ 
ter on G. Then p{Z) = A (4>p(Z)) for all cylinder sets Z. Thus T{x) = 
4>p o t o <f~^ l {x) is uniquely ergodic and (T n x) ne ^ is u.d. for all x in [0,1[. 
Furthermore the spectrum of T is given by 


r = <j exp ( 27 Ti-~i ) : m,I,ceffU {0} 


(4.10) 


Proof. It is well known that p is a Pisot number and equation (4.2) holds 


since |_/3J = 1 = ao- Hypothesis 1.2.59 was proved for this case in [16» The¬ 
orem 4], The proof that Hypothesis 1.2.61 is fulfilled can be found in [5j 


Theorem 3]. Equation (4.10) follows from the proof of Theorem 4.1.1 


Now we have to prove that dp transports the measure // to the Lebesgue 
measure on [0,1[. First we assume k > 3. Let the cylinder Z be defined by 
the fixed digits eo,..., ek-i- We consider four different cases. Suppose first 
efc -3 = e k -2 = efc-i = 0. Then F kfi = 1 ,F M = 2 ,F kj2 = 3 and we get by 
(1.36) that 


p(Z) = p 


-k 


Furthermore, by the same argument as in the first part of the proof of 
Theorem 14.1.41 we obtain 


Mz) 


k -1 


k—2 




L*=o 


pi+ l 5 Z-j pi +1 


+ 


i =0 


(ffc-1 + 1) 

P* 


and thus A (cf)p(Z)) = P k . 

Now let e k -3 = 1 Hk -2 = e?c-i = 0. Hence F kfl = l,F k)1 = 2 ,F kt2 = 3 and 
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n{Z) = (3 k . We have 


M z ) = 


k -1 


k-l 


L *=0 

fe-1 


i =0 




E^T’Esh + ^E^ 


—(3i+l) 


2=0 


fc—1 

V — V — + P~ k 

Z_^ Ri +1 ’ Z_^ fli+1 ~ ^ 


/3' 




thus again A (<fip(Z)) = f3 k . Now assume e ^-2 = l,efe-i = 0. Hence 
F k , 0 = 1, Fk t i = 1, Fk ,2 = 2 and 


KZ) = p 


-k 


P~ Z + 1 


/3- 2 + /?-! +r 


Similarly as above we obtain 

^(Z) = 


k —1 k —1 


E#r.E#rH-r<‘ +1 »E^ (3l+I> 

L 2=0 ^ 2=0 ^ 


2 = 0 


fc—1 k— 1 

e,: \ - 


V 2 V 8 4- «-( fc + 1 ) 

Z_^ fli+l ’ Z_^ 42+1 1 

L 2=0 H 2=0 y 


thus A (4>/3(Z)) = (3 <A' +1 ). Now we have 


P 


—(fc+i) _ p-k 


P 


-2 


1 


^- 2 + / 3- 1 + l 


which is equivalent to /3 -3 + /3~ 2 + fj~ l = (3~ 2 ~\~ \ and holds since (3 is the 
characteristic root of the polynomial x 3 — x — 1 = 0. 

In the last case we assume ek-i = 1, thus F^ q = = 1 and 


M(Z) = f3 


-k 


1 


/3-2 + /3- i + r 

As above we get \{pp{Z)) = /3 - ( fc + 2 ) and the result follows since 

1 


P 


— (fc+2) _ /c 


/ 3-2 + / 3- 1 + l 


is equivalent to/3 3 + /3 1 = 1. The cases where k < 3, follow by the same 


arguments. 


□ 


As a consequence of Theorem 1.2.50| we can construct uniformly dis¬ 
tributed two-dimensional sequences (^(n), where f3\ is the 

characteristic root as in Theorem TL5J @2 is the characteristic root of a 


numeration system as in Theorem 


4.1.1 


Qk 

and ^ Q for all integers k, l > 0. 
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Theorem 4.1.5 extends the examples given in [7* Proposition 13,14], 
where the authors consider G-additive functions which lead to u.d. point 
sequences in the unit interval. Furthermore, it is possible to show that the 
one-dimensional point sequence in the previous theorem is a low-discrepancy 
sequence by mimicking the proof for the 5-adic van der Corput sequence, see 
e.g. mug Eg. 


In the last part of this chapter we want to show that the Kakutani- 
Fibonacci transformation (see Definition 3.1.3) is in fact uniquely ergodic. 
Hence, with this different approach we can show that the orbit of x under 
the transformation is u.d. for every x £ [0,1[. 


For proving unique ergodicity of the Kakutani-Fibonacci transformation 
we need the following lemma. 

Lemma 4.1.6. Let (3 be the golden ratio 1 . Then we have Tx = fp o 
t o cfho^x for all (3-adic rationals 


k 



with coefficients e* £ {0,1}. 


Proof. First let us observe that a = q. It follows from [4] Lemma 3] spe¬ 
cializing L = S = 1 that every positive integer n has a representation of the 
form 

N 

n = ^2 eiU , 

i=0 


where ti is the number of total intervals of the i- th partition. This implies 
that qrvT- We want to show that Tx = <fpo t o (pf l (xi). Then 

we need to see if x belongs to I\, L^k or I- 2 k+\- 
then x = pM- 


If x £ h = 


0 — 

U > ft 2 


In this case 


and 


Ti(x) =x + ^ 


1 , Y' e j ( x ) 

p “ P j+1 


(fpOTO 4> p 1 {x) 


o t (00e 2 e 3 ...) 
0/ 3 ( 10 e 2 e 3 ...) 

OO 

1 + 

p p 2 P j+1 
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Let us fix /c > 1. If x G I 2 fc = 
Ej-,1 0,21+1 + ES+2 ■ Hence 


EtJa 2 ' + 1 ,ELo « 2<+1 


, then x = 


T 2 fe(x) = x + a 2k - ^2 oc 2j+1 

3=0 

k— 1 oo k— 1 

c ^+V* _ £ ^+1 
j=0 2fc+2 j=0 

oo 


^a 2j,+1 + ^2 eja J ' ~a~~ — > cr 


E 

2fc+2 


J+l«2fc 


eja J a 


and 


°T°^ (x) = 4>/3 ° T (1010 . . . 0100e2fc+2C2fc+3 • • • ) 
= ^(01101... 0100e2fc +2 e2fc + 3...) 

= • • • = ^(0 • • • 01100e2fc+2e 2 fc+3 • • •) 
= <^(0 ... 010 e 2 fc + 2 e 2 fc +3 • • •) 

^ OO 

1 Cj 

«2fc+ pj +1 

^ j=2fc+2 ^ 


= a 


2k 


+ E 

j=2fc+2 


e,-q 


j'+i 


Finally, if x G / 2 fe+i = Ej=o « 2(j+1) ; Ej=o a 2(j+1) 


, then x = Ze=o a 2 ^" +1 ) 


+ ES+3 e i« J+1 - Hence 


k -1 

2fc+l \ " „,2(j+l) 


and 


E 2fc+ i (x) = x + a 2fc+i - y^a 

j=o 

OO 

^ e j a j+1 + a 2fc+1 
2fc+3 


o r o (t>2(x) = $0 ° T (0101... 0100e 2 fc+3e 2 fc+4 • • •) 


4>p {HOI • • • 0100e 2 fc + 3e 2 fc+4 • • •) 

■ ■ ■ = 4>p {0 • • • 01100e 2 A: + 3e2fc + 4 ...) 
^(0 • • • 010e 2 fc + 3e2fc+4 • • •) 

l 00 

fc+T+ Z 


Q2k+1 Z—c Qj+1 
^ j=2k+3 ^ 


or ~ 1 ~ H- y ' e^q 1 


2k+ 1_l. 5 Qji+i 

j=2k+3 
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So we have proved that the two maps are the same on the set of all /3- 
adic rationals. Combining this with the ergodicity of the Kakutni-Fibonacci 
transformation we complete the proof. □ 


Theorem 4.1.7. The Kakutani-Fibonacci transformation is uniquely er¬ 
godic. 


I 

tin 


Proof. By Theorem 4.1.41 cfp o t o <f>o l is uniquely ergodic, thus by Lemma 


|4.1.6| and the density of the /3-adic rationals T is uniquely ergodic. 


□ 









Chapter 5 

Integration with respect to 
copulas 


As we have already highlighted in the first chapter, there is a strong connec¬ 
tion between uniformly distributed sequences and the estimate of integrals, 
especially of multidimensional ones. In this last chapter we want to fo¬ 
cus on the integration of two-dimensional functions with respect to copulas. 
We already know that a copula is the asymptotic distribution function of a 
two-dimensional sequence (, x n ,y n ) n with (x n ) ne ^, (y n )n£ N are uniformly 
distributed sequences. We now provide the general definition and we want 
to give upper and lower bounds for these integrals. This can be done by 
drawing a connection to linear assignment problems. These are a family of 
problems consisting in how to assign n items to other n items in an optimal 
way. We will discuss this kind of problems and the way to solve them, show¬ 
ing how we can apply the solution for these problems to the original problems 
of finding bounds for integrals. Of course the second type of problems is dis¬ 
crete, thus this approach gives rise to an approximation technique. Finally, 
we apply our approximation technique to problems in financial mathematics 
and uniform distribution theory, such as the model-independent pricing of 
first-to-default swaps. We refer to [eq m for a complete introduction to 
copulas and to m for a comprehensive treatment of assignment problems. 

Definition 5.1.1 (Copula). Let C be a positive function on the unit square. 
Then C is called (two)-copula iff for every x,y E [0,1[ 

C(x, 0) = <7(0, y) = 0, 

C(x, 1) = x and (7(1, y) = y 

and for every x\,X 2 , yi,y2 £ [0,1[ with X 2 > x\ and y 2 > y\ 

C(x 2 ,y 2 ) ~ C(x 2 , yi) - C(x 1 ,y 2 ) + C(x l ,y l ) > 0. (5.1) 

A function which satisfies ( |5.1| ) is called two-increasing or supermodular. In 
the sequel we denote by C the set of all two-copulas. 
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The word copula was first employed in a mathematical or statistical sense 
by Abe Sklar (1959) in the theorem (which now bears his name) describing 
the functions that join together one-dimensional distribution functions to 
form multivariate distribution functions. 

Sklar’s Theorem, which will be stated below, elucidates the role that copulas 
play in the relationship between multivariate distribution functions and their 
univariate margins. 


Theorem 5.1.2 (Sklar’s Theorem). Let H be a joint distribution function 
with margins F and G. Then there exists a copula C such that for all 
x,y e M, 

H(x,y) = C(F(x),G(y)) . (5.2) 


If F and G are continuous, then C is unique; otherwise, C is uniquely 
determined on Ran Fx Ran G, where Ran F is the range of F. Conversely, 
if C is a copula and F and G are distribution functions, then the function 
H defined by (5.2) is a joint distribution function with margins F and G. 

Theorem 5.1.3. For every copula C and every (u,v) G [0, l] 2 , 


W{u,v) < C(u,v ) < M(u,v) , 


(5.3) 


where W(x,y ) = max(x + y — 1,0) and M(x,y) = min(x, y) are called 
Frechet-Hoejfding lower and upper bounds, respectively. 


It is well known that the Frechet-Hoeffding lower and upper bounds are 
copulas in the two dimensional setting. For higher dimensions an analogon 
of (5.3) exists, however the lower bound is in general not a copula. 


Our main goal in this chapter is to provide bounds for 


l r i 


f(x, y)dC(x, y) . 


o Jo 


Thus, we are interested in bounds of the form 



fix, y) dC m ; n 



f(x, y)dC(x, y) < 



f(x,y)dC max (x,y), 

(5.4) 


for all C € C, where C m i n , C ma , x are copulas. 

A particularly interesting subclass of copulas for our problems are the so- 
called shuffles of M. 


Definition 5.1.4 (Shuffles of M ). Let n > 1, s = (sq, ..., s n ) be a par¬ 
tition of the unit interval with 0 = so < si < ... < s n = 1, a be a 
permutation of S n = {l,...,n} and u: S n —> {—1,1}. We define the 
partition t = (to,... ,t n ), 0 = to < t\ <...< t n = 1 such that each 
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[sj_i, Sj[x|to-(i)_ l5 is a square. A copula C is called shuffle of M with 
parameters {n,s,a,u} if it is defined in the following way: for all i £ 
{l,...,n} if u(i) = 1, then C distributes a mass of Si — Sj_i uniformly 
spread along the diagonal of [sj_i, Sj[x [t c .( i )_ 1 , t a ^[ and if oo{i) = —1 then 
C distributes a mass of Si — Sj_i uniformly spread along the antidiagonal of 

[®i— 1 j '-’i [ ^ \f<r{i)— 1) ^cr(i) [■ 

Note that the two Frechet-Hoeffding bounds W, Ad are trivial shuffles 
of M with parameters {1, (0,1), (1), —1} and {1, (0,1), (1), 1}, respectively. 
Furthermore, it is well-known that every copula can be approximated arbi¬ 
trarily close with respect to the supremum norm by a shuffle of M; see e.g. 
|64l Theorem 3.2.2]. In the sequel we denote by ir n the partition of the unit 
interval which consists of n intervals of equal length. 


In the next paragraph we want to give a description of the linear assign¬ 
ment problem and of the Hungarian algorithm. Then we will illustrate the 
close relation of (5.4) to linear assignment problems. 


Description of the linear assignment problem and of the Hungarian 
algorithm Assignment problems deal with the question of how to assign 
n items (jobs, students) to n other items (machines, tasks). Since there are 
in general many assignments possible, we are interested in the best suitable 
assignment for the problem under investigation. Therefore, we must state 
our goal by specifying an objective function. Given an n x n cost matrix 
A = (a,ij), where a^- measures the cost of assigning i to j, we ask for an 
assignment with minimum total cost, i.e., the objective function Y^= l a io(i) 
is to be minimized. The linear sum assignment problem (LSAP) can then 
be stated as 

n 

minVa^) , (5.5) 

i=l 

where u runs through all possible permutations on n elements. 

Half a century ago Harold W. Kuhn published a famous article [55} pre¬ 
senting the Hungarian algorithm, the first polynomial-time method for the 
assignment problem. 

We briefly describe how it works. The following algorithm finds an optimal 
assignment to a given n x n matrix. 

1. Subtract the smallest entry in each row from all the entries of its row 

2. Subtract the smallest entry in each column from all the entries of its 
column 

3. Draw lines through appropriate rows and columns so that all the zero 
entries of the cost matrix are covered and the minimum number of 
such lines is used 
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4. Test for Optimality: (z) If the minimal number of covering lines is n, 
an optimal assignment of zeros is possible and we are finished, (ii) 
If the minimum number of covering lines is less than n, an optimal 
assignment of zeros is not yet possible. In that case, proceed to Step 
5. 

5. Determine the smallest entry not covered by any line. Subtract this 
entry from each uncovered row, and then add it to each covered col¬ 
umn. Return to Step 3. 

Now we see how to apply this algorithm to our problem. 

Theorem 5.1.5. Letn >1,A = i,...,n be a real-valued nxn matrix 

and let the function f be defined as 


f(x,y):=aij, (x,y) £ 


i — 1 i 


n n 


j - 1 3 


n n 


Then the copula C max which maximizes 

f(x,y)dC(x,y) 


max / 

C £C J [o 5 i [ 2 


(5.6) 


is given as a shuffle of M with parameters {n, vr^, <7*, 1}, where a * is the 
permutation which solves the assignment problem 


max 

(rSP 


T. a i,cr{i)- 


1=1 


Moreover the maximal value of (5.6) is given by 


'[o,T 


f(x, 2 /)dC max (x, y) = ~Tj a i,cr*(i) 


i =1 


(5.7) 


Proof. Let {Ck(x, y), k = 1,..., n! = N} be the set of all shuffles of M with 
parameters of the form {n, 7r n , a^, 1} and let > 0, k = 1,..., N, where 
XfcLi tk = 1- Then C'(x, y ) = J2k=i tkC/fx, y) is always a copula satisfying 

/ f(x, y)dC'(x, y) < - V a i a *a), 

•Ao,i [ 2 

where a* is given in the statement of the theorem. 

For an arbitrary copula C € C we define the matrix Be as 


rial i 


. dC(x, y). 


n ’ n L L n ’ n L 


Bc{i,j) = n 
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It follows by Definition 5.1.1 that Be is doubly stochastic and by Definition 
5.1.4 that Bc k is a permutation matrix. Furthermore it follows from the 


Birkhoff-von Neumann Theorem that the set of doubly stochastic matrices 
coincides with the convex hull of the set of permutation matrices, see e.g. 
[63]. Thus for every C there exist t k > 0, k = 1,..., JV with Yl k =l = 1 
such that 

N 

= ^2t k B Ck (i,j), for every i,j , 
k =1 


and hence 


'[o,i [ 2 


N „ i n 

f(x, y)dC(x, y) =^4 / f{x, y)dC k (x, y) < - V u,wo . □ 

fc=l i =1 


Note that the maximal copula in Theorem 5.1.5 


is by no means unique, 

since for instance the value of the integral in (|5.6|) is independent of the 
choice of u. 


Obviously, we can derive a lower bound in Theorem 5.1.5 by considering 
g(x,y) = —f(x,y). Furthermore it is easy to see that Theorem 5.1.5 applies 
to all functions / which are constant on sets of the form 


Ii,j = [si,s i+ i[ x [tj,t j+ 1[, i = 0, ...,n - 1 , j = 0,... ,m - 1 , 

where 0 = so < si < ... < s n = 1 and 0 = to < t\ <...< t m = 1 are 
rational numbers. 


The following generalization of our approach applies to a wide class of 
functions on the unit square. 

Theorem 5.1.6. Let f be a continuous function on [0, l] 2 , let the sets /”■ 
be given as 


for every n > 1 and define the functions f ,f n as 

fjx, y) = min / (x, y ), for all {x, y) <E Iff,, 

7n( x iy) = max f(x,y), for all (x,y) E IP,. (5.8) 

Furthermore let C " aY be the copulas which maximize 

max / f (x,y)dC(x,y) and max / J n (x,y)dC(x,y), 

c ^ c J[ o,i[ 2 ~ n c eC J [oa[ 2 


i — 1 i 

2 n ’ 2 n 


:i - i j_ 
2 n ’ 2 n 


for i,j = 1, 
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respectively. Then 


./' (x, y)dC^ ax (x, y) < sup / f(x, y)dC{x, y) 

3,i[ 2 CeCJ[ o,ip 

< [ 7n( x i y)dC^ aax (x, y), (5.9) 

mip 


for every n, and 


/ f n {x,y)dC^ iax (x,y) = lim / f n (x,y)dC max {x,y) 

n ^°°J[0,l[ 2 n ^°°J[0,l[ 2 

= sup [ f(x,y)dC(x,y). (5.10) 

cec J[o,i[ 2 


Proof. The inequalit ies in (5.9) follow immediately from the construction of 
f n ,f n and Theorem 5.1.5| Furthermore since / is continuous on [0, l] 2 we 
have that for every e > 0 there exists an integer n such that 

\7n( x ,y) ~ / n 0U2/)l < e > for all (x, y) G [0, l] 2 . (5.11) 

Moreover it follows from Theorem |5.1.5 that for every n we can write 


2 n 

[ f n (x, y)dC n (x, y) = ^ 

J[ o,i [ 2 2 

for a permutation er* and a real valued matrix A = {ch,j}i,i=i,...,n with 

a* ,j = min / (x, y), for z, j = 1, . . . , T. 

(x,y)£iv-. 


Using (5.11), we get that 



f n ( x ,y)dC n ( x ,y) < 



( f n { x ,y) + e)dC n (x,y ) 


1 

2 n 


2 n 

^^( a i,a*(i ) Te) 
i=l 


and therefore 


[ fn( x ’y)<ic n ( x >y) - [ f n ( x ,y)dc n (x,y) 


< e. 


Combining this with (5.9), we get (5.10). 


□ 


The assumption that / is continuous can, perhaps, be relaxed to the case 
that / is C-continuous a.e. for all C € C. This is required to make sure that 



f(x,y)dC{x,y) 
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exists for all C E C. 

By defining the functions / , f n differently, we might get an approxima¬ 
tion technique which converges faster to the optimal value, for instance we 
could use 

' i 3 


fn(x,y) = f 


2 n ’ 2 n 


for all (x,y) E Ifj. 


Furthermore the mini- and maximization steps in (5.8) can be time-consu¬ 


ming, for instance when these problems are not explicitly solvable. However 
the advantage of the present approach lies in the fact that we get an upper 
and lower bound of the optimal value for every n, which is obviously useful 
for numerical applications. 


In numerical investigations where (5.8) could not be solved explicitly we 
used mini- and maximization over a fixed grid in each This results in a 
fast computation, however we obviously lose the property that we get upper 
and lower bounds for every n. 


By assuming Lipschitz-continuity of /, we can describe the rate of con¬ 
vergence of our method. 


Corollary 5.1.7. Let the assumptions of Theorem 5.1.6 hold and, in addi¬ 
tion assume that f is Lipschitz-continuous on [0, l] 2 with constant L. Then 


/ f n (x,y)dC n (x,y)- f(x,y)dC(x,y ) 

/[0,ip J[0,ip- 


< L 


V2 


Proof. Following the proof of Theorem 5.1.6 and using the Lipschitz-conti¬ 
nuity of / we get 


\fn(x,y) - f n (x,y)\ < L 



for all (x, y) E [0, l] 2 , 


and therefore 



f n {x,y)dC n (x,y) 



l n ( x ,y)dC n (x,y ) 



□ 


In this section we present two numerical examples in which we apply the 
approximation technique presented in Theorem 5.1.6 We use an implemen¬ 
tation of the Hungarian Algorithm in MatLab, which makes it possible to 
derive the solution of the linear assignment problem (5.5) for a given ma¬ 
trix A of size 2 10 x 2 10 within seconds. The involved mini- or maximization 
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of the integrand function on a given grid can be done efficiently, since the 
integrand functions are piecewise smooth. 


Now we derive upper bounds for (1.19) by maximizing g over the set of 

for functions / such that 


all copulas. This has already been done in 
^J^(x,y) has constant sign for all (x,y) G [0,1[ 2 . Note that this condition 

is equivalent to the two-increasing property of / provided that §-Jj~(x,y) 
exists on the unit square. 


As a numerical example, we consider 

1 


N 


limsup — y ' sin(7r(.T n + y n )). 
N^oo 


The numerical results are illustrated in Table |5.1[ Note that the approxi¬ 
mations of the lower bound can be easily computed using the symmetry of 
the sine function. 


A further interesting question concerns the sequences (x n ) n> i,(y n ) n> i 


which maximize (1.19). Let {x n ) n> i be a u.d. sequence and C(x,y ) a shuffle 


of M, then it is easy to see that (f(x n )) n> i is u.d., where / is the support 


of C. Thus if C is the shuffle of M which attains the maximum in (1.19), an 


optimal two-dimensional sequence is given as (x n , f(x n ))n> l, where (x n )n>i 
is an arbitrary u.d. sequence. In Figure |5.1j we present the support of the 
copula which attains the upper bound for the maximum in our approxima¬ 
tion when n = 7. 


We point out that in this context the support of a copula is meant as 
the support of the measure gc induced by the copula, i.e. 

Supp(C) = {B€ B([ 0, l] 2 ) : fi C (B) > 0} . 

Although we are not able to give a rigorous proof, by increasing n it 
seems that the copula C' which attains the maximum is the shuffle of M 
with parameters {2, (0,0.75,1), (1), {w(l) = — 1, w(2) = 1}}. In this case we 
have that 


supp(C') = (x,f(x)) : x G 


0 , 


,f(x ) = ^ - x } U { (x,x) : x G 


4’ 


and then 

rl /*! 


n sin(7 t(x+ y))dC'(x,y) = / sm(n(x + f'(x)))dx 

Jo 


ri r 1 

= / sin(7r(a; + 0.75 — x))dx + / sin(7r2a;)dx 

Jo J\ 

3 1 


4\/2 27t 


0.371175 . 
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n 

5 

6 

7 

8 

9 

10 

UB 

0.3933 

0.3824 

0.377 

0.3741 

0.3727 

0.3712 

LB 

0.3482 

0.3598 

0.3655 

0.3684 

0.3698 

0.3711 


Table 5.1: Upper and lower bounds for the maximum in (1.19) with respect 
to n. 



Figure 5.1: Support of copula which attains upper bound for sin(7r(A + Y)) 
and n = 7. 

A first-to-default swap (FTD) is a contract in which a protection seller 
(PS) insures a protection buyer (PB) against the loss caused by the first 
default event in a portfolio of risky assets. The PB pays regularly a fixed 
constant premium to the PS, the so-called spread, until the maturity T of 
the contract or the first default event, whichever occurs first. In exchange, 
the PS compensates the loss caused by the default at the time of default. 

We assume that the underlying portfolio consists of two risky assets, for 
which the marginal default distributions are known, but the joint distribu¬ 
tion is unknown. We want to derive a worst case bound in this setting. For 
the valuation of the FTD we follow the paper of Schmidt and Ward [83 j. 
Note that Monte Carlo methods for the evaluation of hrst-to-default swaps, 
where the dependences within the portfolio is modeled by a copula, are e.g. 
presented in Aistleitner et al. [3] and Packham and Schmidt m- 

Let ti,T 2 denote the random default times of the two risky assets, let 
the notional be equal to one for both assets and Ri,i = 1,2, be the so- 
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called recovery rates, which are the percental amounts of money that can be 
liquidized in case of the default of an asset. We assume that the distribution 
of Tj is given as 

Pfa < t) = 1 - e~ Xit , t > 0, 

where the intensity A* can be derived from the credit default swap market 
as 

A* = 


Si 

1 -Ri 


and Si is the premium of an insurance against the default of asset i. 

Now denote by t = min(ri, 72 ) the first default time in the portfolio, let 
0 = to < t\ <...< t n = T be the payment times of the constant spread and 
assume that there exists a risk free interest rate r > 0. Then, to guarantee a 
fair spread s, we obtain that the expected, discounted premium and default 
payments are equal, i.e. 


i=0 


,—rt-i 


(r > h) = 


J2^[^-Ri)e- rT l{r<TAr=r i} ]- 


i= 1 


By the above assumptions we obtain that 

P( r > ti) — I l{/(:r,Ai)>tj A f{y,\2)>ti}dC{x-, y), 

J[ 0,1[ 2 

2 

E [(1 - Ri) e ~ rTl {T<T A r=n}\ = 

i= 1 


e —,min mxMWyM)) (1 ~ R 1 ) 


A o,i[ : 

T ^'{/(l/i^ 2 )^min(/(x ! Ai),T)}(1 -^ 2 )^ dC(x, y), 

where f(x, A) = lQ gd x ) j s the inverse distribution function of an exponen¬ 
tial distribution with parameter A and 1/( T „vm denotes the characteristic 
function of set B C [0,1[ 2 . { 

Now we want to calculate the maximal spread s by maximizing over all 
copulas. We obtain for the spread that 


0 -rmin(/(x,Ai),/(j/,A 2 )) 


S = 


Vlp Si=0 e r<! l{/(x,Ai)>ti A f(yM)>U} 
l{/(x,Ai)<min(/( 2 /,A 2 ),T)}(l “ R\) 

T ^"{/(2/)^2)<™in(/(x,Ai),T)} (f -^2)^ d,C(x, y). 


(5.12) 
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Note that the value of the integral is finite since the first payment is made at 
to = 0. Furthermore note that the integrand function in this example is not 
continuous, thus Theorem 5.1.6 cannot be applied. Nevertheless it is clear 
that our technique provides upper and lower bounds for the optimal values, 
and since these bounds converge to each other our approach still works. 


In Table 5.2 we present numerical results for a concrete example with 
three payment times, ti = 0,1, 2. One can observe that the resulting copulas 
(given in Figures 5.2 and 5.3 for n = 7,8, respectively) are highly irregular 
in the left upper quarter of the unit square. However for n = 10 the upper 
and lower bounds for the optimal values are almost equal. 


Ai 

A 2 

Ri 

R 2 

T 

r 

L 


i 

3 

1 

2 

0.5 

0.7 

2 

0.05 

(0, 1, 2) 


n 

3 

4 

5 

6 

7 

8 

10 

UB 

0.3601 

0.3355 

0.3301 

0.326 

0.322 

0.3202 

0.3195 

LB 

0.2956 

0.3031 

0.314 

0.318 

0.3183 

0.3189 

0.3195 

UB 

0.1714 

0.1674 

0.1567 

0.1535 

0.1519 

0.1505 

0.1498 

LB 

0.1453 

0.1456 

0.1458 

0.1480 

0.1492 

0.1492 

0.1495 


Table 5.2: Approximation of the maximal spread of a FTD, where UB and 
LB and UB and LB denote the values of the upper and the lower bounds 
of the maximal and minimal value of the integral, and n the fineness of the 


approximation according to Theorem 5.1.6 



X 


Figure 5.2: Copula which attains the upper bound for the maximal value 
with n = 7. 

The method just presented can be used to derive sharp bounds for inte¬ 
grals of piecewise constant functions with respect to copulas. This extends 
the scientific literature on this topic, which is in general still open. 
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X 


Figure 5.3: Copula which attains the upper bound for the maximal value 
with n = 8. 

The numerical effectiveness of our method was illustrated in the two numer¬ 
ical examples coming from different branches of applied mathematics just 
proposed. 

However, as pointed out also in the previous chapters, an interesting 
problem concerns the extension of the method to the multidimensional case. 
This would be of particular interest since finding bounds for multidimen¬ 
sional integrals with respect to copulas has several applications in fields of 
mathematics such as number theory and actuarial mathematics. 
Unfortunately, at the moment this method fails to be optimal in higher di¬ 
mension, since the so-called multi-index assignment problems are in general 
NP-hard. We think that one possible way to tackle this kind of problems is 
to investigate them from a heuristic point of view. 
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